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Preface to the Second Edition 


We have been pleased with the success of our book and its adoption 
as a working text by first year students in many engineering courses at 
Universities and higher technological Institutes. Following our own 
ideas, but also in response to requests, we have, in this edition, included 
new material. In Chapter 9 we have added two new sections, one on 
eigenvalues and eigenvectors and the other an introduction to the idea 
of the gradient of a scalar field. Both these topics are often met by 
engineering students during the first or second year courses in their 
own engineering disciplines. In Chapter 11a new section on isoclines 
has been added and the ideas of the numerical integration of differ¬ 
ential equations form a new section in Chapter 13. Two other new 
sections of Chapter 13 give an elementary introduction to error analysis. 
Now that all students have electronic pocket calculators it seems to us 
important that the concept of accuracy, propagation of error, and error 
bounds should be dealt with early in the curriculum. Additional 
examples have been added to the relevant Exercises. 

This edition also contains a completely new Chapter 17 on Applica¬ 
tions. The teacher of mathematics is often asked by engineering students 
to explain why it is that they have to be taught topics such as eigen¬ 
values, vectors, complex numbers, partial differentiation. Chapter 17 
goes some little way towards showing students that these topics (and 
others) have application in the physical and industrial world. We would 
have liked this chapter to be much larger and to have covered more 
topics but are necessarily restricted by limitation on the size (and price) 
of the book. We will be satisfied if the student finds something of 
interest which will induce further study of mathematics and if the 
teachers of mathematics to engineers can be encouraged to investigate 
further applications themselves. One of us (SCL) is indebted to his 
colleagues at Newcastle upon Tyne for much discussion on the appli¬ 
cations but we both accept responsibility for any mistakes that have 
appeared. We would like to thank all those readers who have written 
to us with suggestions and also those who have indicated errors in the 
first edition. We hope we have incorporated all corrections and ideas in 
this edition. We would be pleased to have any further comments. 

Finally our thanks are due to our publishers who have allowed us 
to amend the text as we wished and have made an excellent job of 
putting together the new edition and have tried to keep the costing to 
a minimum. 

Stanley C. Lennox 
1977 


Mary Chadwick 



Preface to the First Edition 


This book is the outcome of many years of teaching mathematics to 
engineering students particularly in the University of Newcastle upon 
Tyne. It is intended to cover the mathematical content of the first year 
to eighteen months of a three year course in a University or Institute 
of higher technological education. For certain disciplines the contents 
will meet the complete mathematical requirements of the entire three 
year course while for others the contents cover the basic requirements 
before proceeding to more advanced mathematical topics. It covers the 
mathematical syllabus for the Part I examination of the Council of 
Engineering Institutions (CEI). We have assumed a knowledge roughly 
equivalent to a pass in A-level single subject Pure and Applied Mathe¬ 
matics, a Higher National Certificate, or an equivalent qualification. 

Although primarily written for engineering students the text should 
also be of value to science students whose requirements demand a 
manipulative skill in mathematical processes. Where necessary a 
motivation, usually within an engineering context, is used to introduce 
the reader to a new concept before a proof is introduced. Where such 
a proof is omitted it is simply because we believe that its introduction, 
at that point, would unduly delay the development of the subject and 
may well confuse the student. We do not subscribe to the view that the 
engineer or non-mathematical student should never be exposed to 
mathematical rigour but think that at this particular level it is more 
important that he should be able to use and appreciate the place 
of mathematics in his own subject, and to this end it is necessary that 
the student possesses manipulative skill as well as a broad understanding. 
To give an analogy it is not necessary to have a complete and detailed 
knowledge of the workings of an automobile before attempting to 
drive. Indeed an attempt to teach such detail to some would-be drivers 
may well lead them to abandon the whole process in despair. 

In writing the book we have not made any distinction between 
different kinds of engineering student but consider, no matter what 
discipline, that all should have a common mathematical core syllabus. 
It is now accepted that all engineers should be exposed to the mathe¬ 
matical ideas of probability, statistics, and numerical analysis and an 
important feature of this book is the introduction of these subjects 
alongside the more common curricula subjects of analysis and algebra. 
Chapters 1 to 7 cover analysis, including functions, differential and 
integral calculus, convergence, and Taylor series. Chapters 8 to 11 deal 
with the algebra of complex numbers, linear algebra, elementary vector 
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analysis, and ordinary differential equations. Chapters 12 to 14 are 
devoted to the analytical and numerical solutions of algebraic and 
transcendental equations, the numerical solution of a set of linear 
equations, an introduction to finite differences and polynomial inter¬ 
polation, and numerical differentiation and integration. Chapters 15 
and 16 provide a basic statistics course covering distributions, proba¬ 
bility, sampling, and simple tests of significance. 

The book is, however, presented as a whole and it is not intended 
that branches of mathematics be put into different compartments. For 
convenience it is divided into chapters but that does not mean that any 
series of lectures taken from the book need follow the chapter order. 
Certain chapters obviously demand knowledge of the work covered 
previously, for instance it would be difficult to understand Taylor’s 
theorem in Chapter 5 without having read the previous four chapters. 
On the other hand the two chapters on statistics demand a minimum 
of prior knowledge while the linear algebra of Chapter 8 and the vector 
analysis of Chapter 9 may be started after reading Chapter 1. The 
numerical work of Chapters 12, 13, and 14 can be read in isolation 
providing that differentiation and integration is covered before reaching 
the end of Chapter 14. The sketch gives a visual indication of the 
chapters covered by four possible series of lectures and shows the 
linkage points between the series. 



This book can be used as working text and to encourage both under¬ 
standing and skill a large number of examples is given and solved 
within the text. In addition an exercise is presented at the end of each 
chapter. Every example set has been provided with an answer and hints 
are given when it is considered necessary. Most of the examples are 
original but some have been taken from those collected and amended 
over many years of teaching and examining for the Universities of 
Newcastle upon Tyne, Durham, Edinburgh, Glasgow, and London 
together with the Heriot-Watt and Strathclyde Universities. The 
answers to the exercises are collected together at the end of the book 
and while they have all been checked we will be amazed if some are 
not in error. We would be grateful to any reader who is prepared to 
indicate the point of error for future amendment. 

It is inevitable that certain items have had to be omitted and others 
curtailed but a compromise has to be made between various factors 
such as the size of the book, the time devoted to mathematical studies, 
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and the extent of the syllabus. Throughout, however, it has been our 
aim to achieve a satisfactory balance between these factors. 

Finally, we are indebted first to Professor D. V. Lindley and Dr. 
J. C. P. Miller for permission to reproduce extracts from their book 
of Elementary Statistical Tables, and second to Professor L. Maunder 
both for encouraging us to write the book and for providing us with 
many helpful and constructive criticisms. 

Stanley C. Lennox Mary Chadwick 
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Introduction 


1.1 FUNCTIONS OF ONE VARIABLE 

When two quantities are so related that one of them is determined 
uniquely when the other is known, then the first quantity is a function 
of the second quantity. That one quantity is a simple function of another 
does not necessarily mean that there is a formula connecting them. A 
function can be expressed as a table of values, in the form of a graph, 
as a formula, and in other ways. 

1.1.1 Notation and definitions 

When two variables x and y are related so that y is determined uniquely 
when x is known, the functional relationship is written y = f(x) where 
f(x) is the rule prescribing the value of y for any given value of x. Other 
notations such as / = g(x), y — </>(x), y — F(x), </>(x, y) = 0 will be 
used, and y = y(x), x — x(t) are used to mean that y is a function of 
x and x is a function of t. 


(b) 


(c) aC*- J —- *>» 

Fig. 1.1 

Suppose y = f(x), then the variable x is called the argument or 
independent variable and f{x) or y the dependent variable or value of 
/ for the argument x. Values of y = f{x) are found for any given value 
of x by replacing x in fix) by the given value. Thus the value of fix) 
when x = 2 is denoted by/(2). 

The set of numbers over which x may vary is called the range of x 
and is usually the set of points in an interval on the x-axis. This interval 
may be the whole line from — oo to + oo. The set of all real numbers 
x between two fixed numbers a and b, written a < x < b means that 
x may take any value which is simultaneously greater than a and less 
than b. This set of numbers is called an open interval. See Fig. 1.1(a). 
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The correct description of a mathematical real number requires more 
precise detail than is given here. It is sufficient to recognize that a real 
number is either a rational number of the form plq, where p and q are 
positive or negative integers and q ^ 0, e.g. 0, |, —1, +2, — f; or an 
irrational number that cannot be expressed in the form pjq, e.g. \/2, 
77, To each and every point of the x-axis there corresponds 

a rational or irrational number x, called simply a real number. 

The set a < x < b of real numbers x consists of all numbers in the 
open interval plus the left end point x — a. See Fig. 1.1(b). The set 
a < x < b is the set of all real numbers between a and b including 
both end points. This is called a closed interval. See Fig. 1.1(c). If x 
may take all values in a range it is called a continuous variable. 

The functional relationship y = /(x) now determines the set of values 
y may take. In general the function may be represented graphically by 
those points whose coordinates (x, y) satisfy y = /(x) for all those 
values which x may take. 

Example (i) y = x/(x 2 + 1). 

Here x may be any real number and the graph of the function is 
sketched in Fig. 1.2. 



Example (ii) y = y' (x — 1). 

The range of values of x consists in those values for which the function 
has a meaning, so that in this case, x > 1. See Fig. 1.3. 


Example (iii) Let /(x) be the sum of x tends of the geometric series 
1 +* + * + * + •■ • 

Graphically y = f(x) is represented by a set of points which may not 
be joined by a continuous line. Values of x are positive integers only, 
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and the value of the function for non-integral values of x is meaningless. 
The successive sums /(l),/(2),/(3), ... are illustrated in Fig. 1.4. 



Fig. 1.4 

Example (iv) It is possible to define a function by different formulae for 
different values of x, as 



Fig- 1.5 
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This function is defined for all values of x > 1 except for x = 5. Its 
graph is sketched in Fig. 1.5. 

A most important property of the function/(x) is that to each value 
of x in the range there corresponds one and only one value of f Thus 
the equation y 2 = x represents the two functions y = \/x and 
y = — y/x. The expression y/x will always mean the positive square 
root of x when x is real and positive. 


Example (v) The relationship y = y/x 2 defines y as a single-valued 
function of x for all values of x. It is also called the absolute value of 
x or modulus of x or mod x, written |x|. Thus 


V(x) 2 


lx if x > 0 
i-x if x < 0. 


Geometrically, the function |x| is the distance from the origin O to the 
point P which represents the number x on the scale of real numbers 
regardless of whether x is positive or negative. This is shown in Fig. 
1.6(a) where the distance OP t is |xj| and the distance OP 2 is |x a |. 

-«-p, 

OC\ X 


(a) 


(b) 

Fig. 1.6 

The modulus symbol may be used to express a range of values of x, 
for example the range — 1 < x < 1 is equivalent to |x| < 1. 

Also, \x — a\ <2 means that x is within two units of a. This means 
that x may vary between a — 2 and a + 2, so that |x — a\ <2 is 
the same as a — 2 < x < o + 2. This is shown in Fig. 1.6(b). 

1.1.2 Limits, continuity 

Consider a given function/(x) at some particular value of x = a. As 
x approaches the value a, the value of/(x) will usually change steadily 
and approach the value f{a), and it is possible to choose a value of /(x) 
‘as near as we please’ to the value f(a) by choosing x sufficiently near 
to the value a. A different way of saying this is that /(x) tends to f(a ) 
as x tends to a. This is written in symbols, 

/(*)-’■/(*) as x-*a. 



INTRODUCTION 


5 


The value of fia) is the limit of fix) as x -* a, or 
Lim/(x) =f(a). 

If x increases towards the value a, or x -* a from the left, the notation 
fix) ->/(a - 0) as x -> a— 


Lim fix) —fia - 0) 

is used. Similarly if x decreases towards the value a, or x approaches a 
from the right, 

fix) -* f{a + 0) as x-*a+ 


or 

Lim fix) — fia + 0). 

Often it does not matter whether x approaches the fixed value a 
from the left or from the right since in both cases the limit will be f(a), 
the value of the function at x = a. For example, consider the graph in 
Fig. 1.5. If the value of a is between 1 and 5 or greater than 5, 

/(a - 0) =/(a) =fia + 0). (1) 

When equations (1) are satisfied, the function is said to be continuous 
at the point x — a. For continuity at the point x = a it is not sufficient 
that the limits of fix) as x tends to a from the left and right should 
both exist and be equal, for the common value may not be fia). For 
example, consider the function defined by 

fix) = 0 when x ^ 0, fix) = 1 when x = 0. 

Then 

Lim f{x) = Lim fix) = 0, 

but this limit is not equal to /(0) which is 1, and the function is not 
continuous at x = 0. 

Consider the function sketched in Fig. 1.5. For this function, 

Lim fix) = 2, Lim fix) = 5, 

i.e. equations (1) are not satisfied at the point x — 5 and the function 
is not continuous at this point. There is a ‘jump’ in the value of the 
function as x increases through the value 5. 

Consider also the way different functions behave when the argument 
x increases indefinitely. If the value of the function increases indefinitely 
the function is said to ‘tend to infinity’ as x tends to infinity, i.e. 
fix)-* oo as x —* oo. 
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Example (i) Consider the function /(x) = x 2 . For all values of x this 
function is continuous and as x —» ± oo, f(x) —*■ oo. The graph of 
y = x 2 is sketched in Fig. 1.7. 

The graphs of y = x 2n where n is a positive integer are similar to that 
of y = x 2 . 


Example (ii) Consider the function /(x) = x 3 . This function is continu¬ 
ous for all values of x and as x -*■ + oo, x 3 -> + oo, as x -► — oo, 
x 3 -> — oo. The graph of y = x 3 is sketched in Fig. 1.8. 

The graphs of y = x 2n + 1 for positive integers n are similar to that of 
y = x 3 . 



Example (iii) The function /(x) = 1/x is defined and continuous for 
all values of x except x = 0. This example demonstrates two properties 
not mentioned so far; 

(a) as x -*■ 0+, /(x) -+ + oo, 

as x -> 0—, /(x) — oo, 

i.e. the ‘jump’ in the value of y — /(x) as x increases through x = 0 is 
infinite; 

(b) as x -> oo, 1/x ->• 0, yet zero is not a value of/(x), i.e. the limit of 
a function is not necessarily a value of the function, y — 1/x is never 
zero. 

Similarly the function /(x) = 1/x 2 is defined and continuous for all 
values of x except x = 0. For this function, /(x) -+ +oo as x-> 0+ 
and asx^- 0—. See Fig. 1.9 and 1.10. 
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Example (iv) Consider the function f(x) = [a] where [a] is defined as 
the integral part of a. Thus [3-2] = 3, [—4-2] = — 5 and when a is 
an integer, [a] = a. 

This is an example of a step function and at each integer value of a 
there is a point where the graph is not continuous. This function is 
illustrated in Fig. 1.11. 



Fig. 1.11 


Example (v) Consider the function /(a) defined by 




when 0 < a < 1 
a when 1 < a < 2. 



Fig. 1.12 
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Here y = fix) is defined only over the interval 0 < x < 2 and 
/(I - 0) =/( 1) =/( 1 + 0) = 1. 

Hence equations (1) are satisfied and the function is continuous at the 
point x = 1. See Fig. 1.12. 

A function which is continuous at all points in a particular range is 
a continuous function in that range. Examples (i) and (ii) are examples 
of functions which are continuous for all values of x. The functions 
1/x, l/x 2 of Example (iii) are continuous for all values of x except 
x = 0; the function defined in Example (v) is continuous for all values 
of x for which the function is defined, i.e. in 0 < x < 2; the function 
[x] of Example (iv) is not continuous at any integer value of x. 


1.1.3 Odd and even functions 

The function fix) is an even function if/(—x) = fix) for all values of x. 
The graph of an even function is symmetrical about the j-axis, as x 2 , 
l/x 2 , cos x. 

The function fix) is an odd function if /(—x) = —fix) for all values 
of x. The graph of an odd function is symmetrical about the origin, 
as x 3 , l/x, sin x. 

Any function can be written as the sum of an odd and an even 
function, for 


/(*) = «/(*) +/(-*)} + \{fix) -fi-x)}. 

1.1.4 Monotonic functions 

The function fix) is a monotonically increasing function, or /(x) increases 
monotonically in an interval of values of x if/(x) increases steadily as 
x increases through the interval, i.e. if/(x 2 ) > /(x x ) whenever x 2 > Xi. 

Similarly fix) is a monotonically decreasing function if /(x) decreases 
steadily as x increases through an interval, i.e. if/(x 2 ) </(x x ) whenever 
x 2 > x x . 


1.2 INVERSE FUNCTIONS 

Suppose y = fix) is a continuous function of x, then it is sometimes 
possible to express x as a function of y. For example, the formula 
y = mx + c determines y uniquely when x is given, so that y is a 
function of x from the definition of function given in section 1.1. Also 
x — iy — c)/m is a function of y. 

Consider the function given by j = x 2 ; y is determined uniquely 
when x is given, for all values of x. If x is expressed in terms of y there 
are two possibilities, x = ± \/y and x is not uniquely determined when 
y is given. Hence from the definition of a function, x cannot be expressed 
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as a function of y but is expressed as two functions of y for all positive 
values of y. 

If a given function y — f(x) can be replaced by a function x = <f>(y), 
then x = <f>(y) is called the inverse function of y — f{x). For this to be 
possible, to each value of x there is one and only one corresponding 
value of y and to each value of y there is one and only one corresponding 
value of x. 

Example The function y = x 2 , —2 < x < 2, has no unique inverse 
function since x = ± \/y gives two functions in the range. 

The function y = x 2 , 0 < x < 2, has the unique inverse function 
x = \/y. These functions are sketched in Fig. 1.13 and 1.14. 





Suppose y = fix) is defined in the range a < x < ft and has a graph 
as in Fig. 1.15. For each value of x in the range y is uniquely determined 
but to each value of y there may be more than one corresponding 
value of x. Hence this function cannot be inverted in the whole range 
a < a < b. 
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In the range a < x < c, x is uniquely determined when y is given, 
fix) is monotonically increasing and hence a: can be expressed as a 
function of y in a < x < c. Similarly x can be expressed as a (different) 
function of y in the range c < x < d where the function is mono¬ 
tonically decreasing and in the range d <x <b where the function 
is monotonically increasing. 

More generally if y = f(x) is continuous and monotonic in an 
interval then it is possible to define the inverse function x = <f>(y). 
Consider the graphs of y = f(x) and x = <f>(y) where y = f{x) is con¬ 
tinuous and monotonically increasing in a < x < b, and a < y < /J. 
These graphs are sketched in Figs. 1.16 (a) and (b). 



Geometrically the graph of the inverse function is drawn by reflecting 
the graph of y = /(x) in the line y = x, or by ‘rotating’ the axes of 
the graph of y = /(x) through 90° and reflecting in the new x-axis. 


1.2.1 The inverse circular functions 

The function y = sin" 1 x or y = arc sin x is defined as the value of y 
such that 

x = sin 7 and — (2) 

That is the function x = sinj> is inverted only in that range where 
there is a one to one correspondence between values of x and values 
of y. There are many ranges for y where this is possible. The range 
chosen in equation (2) is that one which includes y = 0. 

Note that if x = sin y there are many values of y which satisfy the 
equation for a given value of x. For example if sin y = then 
y = rm + (—OW, n = 0, ±1, ±2, .... But sin _1 (|) = \-tt. Figure 
1.17 shows sketch graphs of x = sin y and y — sin -1 x. 

Similarly the expression x = cos y is many-valued and to obtain 
the full range of values of cos once and once only, the range is 
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(b) 


restricted to 0 < y < v where the function cos y is monotonically 
decreasing, i.e. y — cos " 1 x or y = arccos x is defined to be that value 
of y such that 

x = cos_y and (3) 

Figure 1.18 shows the graphs of 7 = cos -1 x and x — cos y. 



The expression x = tan y is many-valued but takes all values once 
only in the range — \it < y ^ \n where it is monotonically increasing. 
Hence the inverse function y = tan -1 x or y = arctan x means 
x = tan j' and —^jt < y < 

Figure 1.19 shows the graph of y — tan -1 x. 


(4) 
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The difference between sin -1 * and (sin x)“ 1 = cosec x should be 
noted. 


Example Prove that tan" 1 2 + tan" 1 3 = fw. Is it true that tan” 1 2 + 
tan -1 3 = tan _1 (—1)? 

Solution Let tan" 1 2 = a, tan" 1 3 = p, then 


tan (a + p) = 


tan a + tan ft _ 2 + 3 
1 — tan a tan p ~ 1 — 6 


-1. 


(i) 


Now by definition of the inverse tangent, — fn- < a < and 
— far < p < \tt. Moreover since tan a > 1 and tan /? > 1 both a and 
P are greater than i.e. \tt < a < \n and \tt < ft <\tt hence 
< a. + p < n, and this with equation (i) means a -f /? = f-n-. 

By definition, tan _1 (— 1) = — fa so that tan" 1 2 + tan -1 3 ^ 
tan" 1 (—1). 


1.3 COORDINATE SYSTEMS 
1.3.1 Coordinate systems in the plane 

The position of a point P in the plane is fixed when its rectangular 
Cartesian coordinates (x, y) are known. These are coordinates referred 
to two perpendicular axes Ox, Oy through an arbitrarily chosen point 
O called the origin. The x-coordinate is called the abscissa and the 
j-coordinate the ordinate of the point. 

The position of the point P in the plane is also fixed if its distance r 
from a fixed point O in the plane and the angle 6 the line OP makes 
with a fixed straight line in the plane are known. The point O is called 
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the pole, the fixed straight line is called the initial line. The distance r 
is always positive and the angle is measured positively, or anti-clock¬ 
wise, from the initial line so that the direction is specified by the angle. 
The point P is then given the polar coordinates (r, 0). 

Polar and Cartesian coordinates are transformed one to the other if 
the origin is used as the pole and the positive x-axis is the initial fine 
by the equations 

x = r cos 0, y = r sin 0. 

These equations may also be expressed 

r = x/(x 2 + y 2 ), cos 0 = x/r, sin 0 = y/r. 

See Fig. 1.20. It is not sufficient to consider tan 0 = yjx since the 
value of 0 is not then uniquely determined. For example, if both the 
x- and ^-coordinates are negative the point is in the third quadrant, 
yet tan 0 is positive giving the point (r, 0) in either the first or the third 
quadrant. A rough sketch will often prove useful. 

Example (i) The point P whose Cartesian coordinates are (1, \/3) has 
polar coordinates (2, Jtt). 

Example (ii) The point Q whose Cartesian coordinates are (— 1, \/3) 
has polar coordinates (2, j»r). See Fig. 1.21. 




1.3.2 Coordinates in three dimensions 

(i) Cartesian coordinates A point P in three-dimensional space is 
specified by the coordinates (x, y, z) referred to three mutually perpen¬ 
dicular axes through an origin O. The set of numbers specifying the 
coordinates of P may also be written (x l5 x 2 , x 3 ). See Fig. 1.22. 

(ii) Cylindrical polar coordinates In three dimensions the point P is 
fixed when the cylindrical polar coordinates (p, <f>, z) are known, where 
z is the perpendicular distance of P from the plane (xy) and (p, <f>) are 
the polar coordinates of the projection N of P in the (xy) plane. See 
Fig. 1.23. 
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The relations connecting cylindrical polar coordinates with Cartesian 
coordinates are 


x = p cos <j>, 
y = p sin </>, 
Z =t z. 



Fig. 1.22 Cartesian coordinates 



(iii) Spherical polar coordinates The point P in three-dimensional 
space is fixed by the spherical polar coordinates ( r, 0, <f>), where r is 
the distance of P from the origin O, 6 is the angle made by OP with 
the positive z-axis and <f> is the angle made by the projection ON of 
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OP on to the (xy) plane with the positive x-axis. The transformation 
from (x, y, z) coordinates to (r, 0, <f>) coordinates is given by 

x = r sin 0 cos <j>, y = r sin 0 sin tf>, z = r cos 0. 

Note that ON = r sin 0 = p. See Fig. 1.24. 



Fig. 1.24 Spherical polar coordinates 


1.3.3 Coordinates in more than three dimensions 

The set of numbers (x, y, z) or (x lt x 2 , x 3 ) determines the position of 
a point in space referred to rectangular axes through an arbitrary origin. 
Suppose the point is specified by a fourth variable also, then the set of 
numbers (x 1( x 2 , x 3 , x 4 ) will fix the point completely. This set may be 
written more shortly as (x f ), where i = 1, 2, 3, 4. The fourth coordinate 
could, for example, be the time t. 

It may be that a set of n numbers is required to describe a point 
uniquely. Then the set of numbers (x lt x 2 , x 3 ,. . ., x n ) or (x,), 
i = 1, 2, 3,. . ., n describes the point completely. 

1.4 FUNCTIONS OF TWO VARIABLES 

When three quantities are so related that one of them is uniquely 
determined when the other two are known, the first quantity is a 
function of the other two quantities. For example the volume of a 
cylinder is a function of its radius and height given by the formula 
V — vr 2 h. The variables r and h are independent; if one depends on 
the other, then V is a function of one variable only. 

Consider the function a> = /(x, y), where x and y are independent 
variables, i.e. c o is uniquely determined when x and y are given, and is 
defined over some range R of values of (x, y). If a geometrical interpre¬ 
tation of o) = f(x, y) is required, then the number pair (x, y) may be 
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represented by a point N of rectangular coordinates (x, y) in the region 
R of the xy- plane. The value of the function co or f(x,y) is then repre¬ 
sented by a line NP drawn perpendicular to this plane at the point N 
of length equal to f(x,y). Thus as N varies over the region R the 
corresponding point co = f(x, y) varies over a surface consisting of all 
points P with Cartesian coordinates (x, y,f(x, 7 )). 

1.5 RULES FOR INEQUALITIES 

(i) If a < b and b < c then a < c. 

(ii) If a < b then a + x < b + x for every x, i.e. the inequality 
relationship is unchanged when the same number (positive or negative) 
is added to both sides. 

(iii) If a < b then pa < pb when p is positive and qa > qb when q is 
negative. In particular if a < b then —a > —b. This is easily seen if 
a and b are considered as points on the x-axis. 

-b ^ 0 t b x 

Fig. 1.25 

In Fig. 1.25 the inequality a < b means that a is to the left of b and 
—a is to the right of — b on the line in which numbers increase from 
left to right. 

For example, 3 < 4, — 3 > —4; 

6 < 8 , -3/2 > -4/2. 

Thus the inequality is unchanged if both sides are multiplied by a 
positive quantity; but the inequality is changed when both sides are 
multiplied by a negative quantity. 

(iv) If a < b, and ab is positive, then 1/a > l/b. 

For example, 3 < 4 and i > ii 
-3 < -2 and -J > -£. 


EXERCISE 1 

1 Find the values of/(2),/(— l),/(l/x),/(x + h) when/(x) = x 2 + 1/x. 

2 Find the values of/(0), f(&n), /(3n-/2) when/(x) = sin x + cos x. 

3 Solve the following equations for x in terms of y and discuss the 
possible ranges of x and y. 

X _ 1 

(0 y = (id) x 2 + xy+y 2 = 3; 

(ii) y 2 = ; (iv) y = 
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4 Investigate the continuity of the following functions at x = 0, and 
draw a rough sketch of the graph of y =fix) in each case: 

(i) fix) = |*|; (ii) f(x) = l/(x 2 + 2x); 

(0 when x < 0 
jl when x>0; 

when x < 0 
when x > 0. 


(iii) f( x ) = 


5 Show that the function x + [x] is not continuous at any integer 
value of x and draw a rough sketch of the graph of y = x + [x]. 

6 (i) Sketch the graph of y — l/(x — 1). 

(ii) Sketch the graph of y = /(x) where /(x) is defined by 
.. . _ fx — 1 when 0 < x < 2 
JW - | 3 _ x when 2 < x < 3. 


7 Show that (i) x 2 < 1 is equivalent to |x| < 1; 

(ii) if x 2 > 1 then x > 1 or x < — 1. 


8 Describe the range of the variable x without the symbol ‘modulus’ 
in each of the following: 

(i) |x| <3; (iii) |x - 2| <4; 

(ii) jx| >3; (iv) |x — 3| < 3 and at the same time 

|x-4| >2. 

9 Given that 0 = sin -1 (—£), find the values of cos 6, tan 6, sec 0, 
cosec d. 


10 Given that 6 = cos -1 (—$), find the values of sin 6, tan 0, sec 0, 
cosec 0. 

11 Evaluate (i) sin -1 (1) — sin -1 (—1); 

(ii) tan -1 (1) — tan -1 (—1); 

(iii) sec -1 (2) — sec -1 (—2). 

12 Describe and sketch the locus of points P whose rectangular 
Cartesian coordinates are (x, y, z) and which satisfy the following 
pairs of simultaneous equations: 

(i) x = constant, y = constant; (iii) x 2 + y 2 = 9, z — —3; 

(ii) y = x,z = 4; (iv) y = 0, x 2 /a 2 + z 2 /b 2 = 1. 

13 Describe and sketch the locus of points P whose cylindrical polar 
coordinates (p, <j>, z) satisfy the following pairs of simultaneous 
equations: 

(i) p = 3, z = 2; (ii) <f> = Jir, z = p. 
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14 Describe and sketch the locus of points P whose spherical polar 
coordinates (r, 9, <f>) satisfy the following pairs of simultaneous 
equations: 

(i) r = 5, <f> = J 77 ; (iii) <f> = 6 = Jtt; 

(ii) r = 4, 6 = \n; (iv) <f> = far, r = 4 cos 6. 

15 Transform each of the following equations from the given co¬ 
ordinate system into forms which are appropriate to the other two 
systems: 

(i) x 2 + y 2 + z 2 = 4; (iii) z 2 = p 2 ; 

(ii) x 2 + y 2 + z 2 = 9z; (iv) r = 4 cos 9. 



2 

Differentiation 


2.1 DEFINITION OF THE DERIVATIVE 


2.1.1 Formal definition 

Let y = f(x) be defined and continuous for all values of x in the interval 
a < x < b and let x 0 be a value of x in this interval. Then the derivative 
or differential coefficient of the function at x 0 is defined as 


>»—o " 


( 1 ) 


when this limit exists. It is denoted by/'(x 0 ). At each point x 0 where the 
limit is finite the function is said to be differentiable at the point. When 
the derivative exists over the whole range of values of x the function is 
differentiable in that range. 

The value x 0 may be any allowable value of x, consequently it is usual 
to write x instead of x 0 in equation ( 1 ) and define the derivative of/ as 


fix) 


^,/c+a-jw 

ft—o h 


( 2 ) 


remembering that x is to be held constant, while h varies and approaches 
zero. 


Example Find from first principles the derivative of 
ffx) = x 3 - 3x + 3, 

where — oo < x < oo. 

Solution f(x + h) = (x + Kf — 3(x + h) + 3 

so that f(x + h) - f(x) = 3x 2 /i + 3 xh 2 + h 3 -3h 

and w-D + tax + k). 

As h -> 0, 3x 2 — 3 is held constant while h(3x + h) -> 0 and 
f'(x) = 3x 2 - 3. 


2.1.2 Geometrical interpretation 

Let P(x 0 ,f(x 0 )) be any point on the curve y = /(x) and 
Q(x o + h,f(x 0 + h)) 
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be another point on the same curve. Then in Fig. 2.1 the slope of the 
line joining the points P and Q is QR/PR where 

QR _ f(x 0 + h) -fjx „) 

PR h 

If the value of this quotient approaches a finite limit as Q -* P along 
the curve, P being held fixed, then the slope of PQ tends to the slope 
of a line through P which is defined to be the tangent at P. Then 

ft— 0 n 

is the slope of the tangent to the curve at P, or more briefly, the slope 
of the curve at P. 



2.1.3 Notation 

It is sometimes convenient to denote an increment in the value of x 
by bx (= h), an increment of the function /by bf or of y by by, so that 

V(x 0 ) =f(x 0 + bx) -f{x 0 ), (3) 

and substituting in equation (1) 


fix 0 ) = 


¥(*<>) 

bx 


(4) 


-is(£L 


(5) 
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The derivative of y = f(x) when it exists is a function of x and is 
written f'(x) or y . From the right hand side of equations (4) and (5) is 
derived the notation 



where x 0 is replaced by the more general x. The quotient form dy/dx 
derives from equation (5) and is a limit of a quotient. It is not dy — dx. 
The symbol d/dx, regarded as an operator which when applied to /(x) 
gives fix), is sometimes written D or D x and must operate on some 
function of x. Hence Dy means dy/dx. Thus x Dy and y Dx are entirely 
different, for x Dy = x dyldx and y Dx — yd(x)jdx = y. 

The limit f'(x — 0) if it exists is called the derivative on the left, and 
similarly /'(x + 0) is called the derivative on the right. 

2.1.4 Continuity 

Continuity of the function f(x) is necessary for the existence of /'(x) 
since the limit in equation (2) cannot exist unless /(x + h) ->/(x) as 
h-*- 0, but this condition is not sufficient. For if fix — 0) and fix + 0) 
both exist and are unequal, then the derivative fix) does not exist. 

Example Consider the differentiability of the function /(x) = |x — 1| 
at the point where x = 1. 

Solution For the graph of y = |x — 1| see Fig. 2.2. 



Fig. 2.2 


The function is continuous for all values of x including x = 1 but 
/'(1 — 0) = —1 and /'(1 + 0) = 1; i.e. the derivative at x = 1 does 
not exist and the function is continuous but not differentiable at x = 1. 

2.1.5 Higher derivatives 

If fix) exists then it is a function of x which may or may not be 
differentiated. If/'(x) has a derivative it is denoted by f"{x) or d 2 y/dx 2 
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or D 2 y and it is the second order derivative of y = f(x). Similarly third, 
fourth, up to «th order derivatives may be defined: 

If y = /(x), then 


Dy 




^ s(i)- 


oy ™ % =/“’« 


* 

dx \dx n V 


2.2 STANDARD DERIVATIVES AND RULES 


2.2.1 The derivative of a constant 

Let/(x) = c where c is a constant, then f{x + h) — c 

/(x + A) -/(*) = 0. 


Hence 


f'(x) = Lira 

ft-»0 


/(X + h) -f(x) 
h 


0 , 


and 


i.e. the derivative of a constant is zero. 


2.2.2 The derivative of x n 

Let/(x) = x" where n is any positive integer, then 

f(x + h) = (x + h) n = x n + »x n-1 A + h 2 (terms in x and h). 

If h =£ 0, {/(x + h) - f(x)}jh = nx"- 1 + h (terms in x and h). Now 
let h -* 0 , then f'(x) — nx B_1 . 


2.2.3 The derivative of sin x, cos x 
d 

(0 J x (sin x) = cos x. 

From the definition, 

sin (x + h) — sin x 2 cos (x + \h) sin \h 
h = h 

= cos (x + \h) ( 6 ) 

Since cos (x + \h) -*■ cos x as h -*■ 0 and since 


sin \h 

~W 


1 as h -»■ 0 
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then the limit as h -> 0 of equation ( 6 ), or the derivative of sin a, is 
cos x. 
d 

(ii) ~jr (cos a) = — sin x. 

From the definition, 

cos (a + h) — cos x —2 sin (a + \h) sin \h 
h - h 

. , sin \h 

= - sin (a + \h) —g- 

— sin a . 1 as h -*■ 0 . 


2.2.4 The derivative of cf( a) 

If /(a) is a differentiable function of a and c is a constant, then 

{</(*)} - cf( a ). 

Let <f>(x) = c/(a), then for h ^ 0 

<t>(x + h) - <f>(x) _ cf(x + h) - c/(a) 
h h 

| + h) -/(A) j 

Let h -*■ 0, then </>'(a) = c/'(a). 

2.2.5 Differentiation of sums 

The derivative of the sum of a finite number of differentiable functions 
of a is equal to the sum of their derivatives, and more generally if /(a) 
and g(x) are differentiable and a and b are constants, 

^ W(x) + *?(*)} = af\x) + bg'(x). 

This is easily proved using section 2.2.4 and the definition of a deriva¬ 
tive, and can be extended to a linear function of any number of differenti¬ 
able functions. 

Example /(a) = a 3 + 7a 2 — 5a + 4 

nx) ” 5 1M + 7 S 1M - 5 Tx w + 7i < 4) 


= 3a 2 + 14a - 5. 
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2.2.6 Differentiation of products 

The derivative of the product of two differentiable functions f{x) and 
g(x) is f'(x)g(x) +/(*)#'(*)• For let <K*) =f(P c )g(x ) and h =£ 0 then, 

j Hx + h) - <f>(x) 
h 

fix + h) g(x + h) -f(x)g(x) 
h 

_ f{x + h) g(x + h) - fix + h) g(x) + fix + h) g(x) -/(A) gjx) 

+ *> P±M] 

Since /(x) and g{x) are differentiable and hence continuous, as h -> 0, 
it follows that 

<i>'(x)=f(x)g’(x) + g(x)f'(x). 

2.2.7 The derivative of the reciprocal l/g(*) 

The derivative of the reciprocal of g(x) is —g'(x)/{g(x )} 2 at a point 
where g(x) 0 , for, let <f>(x) = l/g(x), h ^ 0 , then 
4>(x + h) — i(x) _ i r i _ j_-I 

h h |_g(x + /i) g(x)J 

_ ~[g(x+ *)-*(*)] 

*£(* + %(*) 

_ -1 [ g(x + *) - g(*) l 

“ g(x + h)g(x) L h J- 

The proofs of the formulae in sections 2.2.6 and 2.2.7 depend on the 
assumption that the limit of a product (or quotient) is the product (or 
quotient) of the limits. 

Example Show that the derivative of x n is nx n ~ 1 when n is a negative 
integer. 

Solution Let n = —m where m is a positive integer, then 



Applying the above result and using section 2.2.2, gives 
dy —mx m ~ x 
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2.2.8 Differentiation of a quotient 

The derivative of the quotient of two differentiable functions /(x) and 
g(x) follows by applying the formulae in sections 2 . 2.6 and 2 . 2.7 for, 
let 


4>{x) = = f(x) x - 7 — 

g(x) JK g(x) 

then at a point where g(x) ^ 0 , 

JL’(y\ _ -f(x)g'(x) 

[g(x)f 

Example (i) ^ (tan x) = sec 2 x 


since 

tan x = 


(tan x)' = 

Similarly 

(cot x)' = 

Example (ii) 

Tx^ x) = 

since 

sec x = 


(sec x)' = 

Similarly 

(cosec x)' = 


cos x cos x + sin x sin x 
(cos x ) 2 


2.2.9 Differentiation of a function of a function 

Suppose/is a function of g and g is a function of x then/is a function 
of a function of x and 

£[/fe(*)}] =f'{g(x)}g'(x) (7) 

at a point x where g(x) is differentiable and / is differentiable at g(x). 
The symbol f'{g(x)} means the derivative of the function / with respect 
to the bracket { } or g(x), at g(x). 
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Writing g(x) = t, then y = fit) and equation (7) may be written 
dy dy dt 
dx~dtdx 


( 8 ) 


For if dx ^ 0, 


and 


dt = g(x + dx) — g(x), 
dy =f(t + dt) -f{t). 


dy dy dt 
dx ~ dt dx 


if dt^O 


or 


r = ° 


if dt = 0 . 


If dt 0, as dx -*■ 0 


If (3/ = 0 but <5x^0 


dy dy dt 
dx~ dt dx 

• 7 - = 0 and -7- = 0 . 
bx dx 

A rigorous proof of this may be found in a more advanced text-book. 
It follows that the derivative of f(ax + b), a and b are constants, is 
af\ax + b). (9) 

Example (i) If y = cos (3x + 2), 

then / — ~ 3 sin (3x + 2). 

Example (ii) If f(t) is an arbitrary differentiable function and g(x) is 
such that 

y =/&(*)] = * 

then ( 8 ) becomes 

1 = (dxldt)(dtldx) 

i.e. dtjdx = ll(dx/dt), provided dx\dt ^ 0 

(see also section 2 . 6 ). 

Example (iii) If V = tan 2 x = (tan x) 2 , 

then / = 2 (tan x) sec 2 x. 


Example (iv) If y = cos 2 (sin 3x) = {cos (sin 3x)} 2 , 
then y' = 2{cos (sin 3x)}[— sin (sin 3x)] 3 cos 3x 

using equation ( 8 ) repeatedly. 


2.2.10 Parametric differentiation 

When y — f{x) and x and y are both functions of a single variable, 
say t, called a parameter, i.e. x = x(f), y = y{t), and x and y are 
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differentiable then, 

i-li® provided (io) 

This follows from (8) and Example (ii) of section 2.2.9 and is useful 
when a curve is given parametrically in the form 

x = x(t), y = y(t). 

Example Find dy/dx and d 2 y/dx 2 as functions of t when 
x = a(t — sin t), y = a{ 1 — cos t). 

Solution x — a{t — sin t) 

dx/dt = a(l — cos t) 
and y = a(l — cos t) 

dy/dt = a sin t, 

and hence, ^ 

dx dx/dt 


provided 1 — cos t + 0. 

This may be simplified using the half-angle formulae of trigonometry 


dx ~ 2 sin 2 it 

Differentiating again and using section 2.2.9, 
d 2 y d (dy\ d / t , dt 
dx 2 ~ dx (dx) ~ dt (c0t ^ dx 


important to notice the procedure for finding ^ as 
} The expression (function of t) must be calculated as 


2.2.11 Implicit functions 

These are given in the form f(x, y) = 0 or /(x, y,c) = 0 where c is 
some number which may be a parameter, that is as equations which 
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involve x and y in such a way that y is not given explicitly in terms 
of x, e.g. x 2 + y 2 = 1; x 5 + 4xy 3 — 3y 5 = 2. 

It is possible to calculate dy/dx from such an equation by implicit 
differentiation using the previous rules. 


Example (i) Find dy/dx at any point (x,y) on the curve given by 
x 5 + 4xy 3 — 3 y 5 = 2. 

Solution Differentiate both sides of this equation with respect to x, let 
D be the operator d/dx 

D(x 5 ) + D(4xy 3 ) - D(3y s ) = Z>(2) 

5x 4 + 4 {y 3 + xDy 3 } — 3D(y s ) = 0 
5x* + 4 y 3 + 4x3 y 2 Dy - 3 X 5y*Dy = 0. 


Hence at points (x,y) where 15y 4 — 12xy 2 ^ 0, 


Dy 


dy 5x 4 + 4 y 3 
dx ~ I5y* — I2xy 2 


Example (ii) Find dy/dx if y 3 — lx — 1. 


Solution 

(a) Differentiating both sides of the equation with respect to x, then 



or 


dy _2_ 

dx ~ 3 y 2 ‘ 


(b) Write x = \y 3 + J, then 

dx_3f 
dy 2 
dy _ 2 
dx^Jf' 


Thus dy/dx can be calculated as the reciprocal of dx/dy provided y is 
a differentiable function of x and x is a differentiable function of y. 


Example (iii) Show that the derivative of x n is nx n 1 when n is a rational, 
non-zero fraction. 


Solution If n is rational it is of the form p/q where p and q are integers 
and q 0. If p # 0 and p, q have no common factor then n is of the 
required form. 

Then y = x n = x p "> 

and y Q = x p 

where p and q are non-zero integers. 
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Differentiating with respect to x, 
dy 

& ±c=P x 

or — = ^;l>-l-(p/9)C4-l> 

dx \q) 



Hence ~ x* = nx*' 1 . 


2.3 REPEATED DIFFERENTIATION, LEIBNITZ’ 
THEOREM 


Consider first some examples. 

Example (i) Show that the nth derivative of sin ax is a n sin (ax + mr/2). 
Solution Let f(x) = sin ax 
f'(x) = a cos ax 

= a sin (ax + £n) 

i.e. / <n) (x) = a n sin (ax + mr/2) is true when n = 1. The method of 
induction is used for the proof. Suppose that for some fixed integer 
value of n, 

f n \x) = a n sin (ax + njr/2) 

Differentiating both sides, 

/ <n+1) (x) = a n+1 cos (ax + mr/2) 

= a n+1 sin (ax + mr/2 + w/2) 

= a n+1 sin {ax + (n + 1 )tt/2} 

i.e. the result is true when n is replaced by n + 1. Since it is true when 
n = 1 it is true for all integer values of n > 1. 

Example (ii) Find the nth derivative of sin 2x when x = 0. 

Solution D n (sin 2x) = 2“ sin (2x + mr/2) using example above. When 
x = 0, D n (sin 2x) = 2 n sin (mr/2) and this has different values accord¬ 
ing as n is even or odd. 

When n is even, write n — 2k: 
at x = 0, D 2k (sin 2x) = 2 2k sin kw = 0. 

When n is odd, write n = 2k + 1: 
at x = 0, D Sk+1 (sin 2x) = 2 Zk+1 sin (2k + 1)tt/2 

= 2 zk+1 (— l) k . 
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2.3.2 Leibnitz’ theorem 

Let/ and g be functions of x which may be differentiated n times. Then 
D{fg) =fDg + gDf 

Repeatedly applying the rule for differentiating products and using 
D w for the operator d n ldx n , 

D 2 (fg ) =fD 2 g + 2 DfDg + gD 2 f 
D'Vg) =fD 3 g + 3 DfD 2 g + 3 D 2 fDg + gDJ 
D*(fg) =f^g + 4 DfD 3 g + 6 D 2 fD 2 g + 4 DjDg + gD% 
These coefficients are binomial coefficients and the result 

D w (fg) =f» <n) g +■ (j) DfD^g + ...+ (”) D«fD«-"g 

+ ...+gD*f (11) 

for any positive integer n is implied and may be proved by induction. 
It is called Leibnitz' theorem. 

Example Find D {n) {; c 2 sin x}. 

Solution 

D w {x 2 sin x} = x 2 D ln) (sin x) + nD(x 2 )D n ~ 1 (sin x) 

+ in(n - l)D 2 (x 2 )D n ~ 2 (sin x) 

= x 2 sin (x + \mr) + nix sin {* + 1(« — l)rr} 

+ \n{n — 1)2 sin {x + £(n — 2}n}, 

since D (n) (x 2 ) = 0 when n > 2 and Z) (B) (sin x) = sin (x + |mr) from 
example (i) of section 2.3.1. 


2.4 THE INCREMENT OF A FUNCTION 

Consider an estimate of the change 6y produced in a function y = fix) 
when x changes by a small amount Sx. Let P be the point (x, y) on the 
curve y = /(x) where the function is differentiable, and let Q be the 
neighbouring point (x + dx, y + <5y) where dx =£ 0. Then the slope of 
the chord PQ is dyjdx which approaches the limiting value dyfdx as 
dx -+ 0. Hence the difference between 8y[dx and dyjdx is numerically 
small when |<5x| is small. Let this difference be denoted by e, i.e. 


8y 

dx 


dy 


dx 


( 12 ) 


Then the statement 


dy dy _ 


(13) 
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is equivalent to the statement 


^1^ 

II 

o 

Lim e = 0 

<»x— 0 

(14) 

+ e where e -> 0 as dx 0, 

(15) 

and multiplying by dx 

dy — ^ dx + edx 

(16) 


Although equation (16) was derived under the assumption that 
dx # 0, it is still true even when Ac = 0, since it is only necessary to 
define e = 0 when dx = 0. 



Consider geometrically the graph of = f(x) as sketched in Fig. 2.3. 
The first term on the right hand side of equation (16) is represented by 
the length TR, since the slope of the tangent PT is dyldx, and this is 
the change produced in y along the tangent at P. When dx is small, 
this term is usually large compared with the second term sdx. The 

difference between QR = ^ dx + sdx and TR = ^ dx is edx which 

tends to zero more rapidly than dx does when dx approaches zero. 
For example, if y = f(x) — x 2 , dyldx — 2x — f\x) and 

~dx = 2x dx. 
dx 

On the other hand the exact value of dy is given by 

dy = 2x dx + (dx) 2 — f(x + dx) — f(x). 
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Comparing this with equation (16), for this particular example, 
edx = ((3.x) 2 . When Sx is small, (<5x) 2 is smaller than 8x, and dy is 
approximately equal to 2x8x. This example may be illustrated geo¬ 
metrically as in Fig. 2.4. 



Fig. 2.4 


The original square has side of length x and hence area y = x 2 . When 
the sides are increased to x + dx the area is increased by <5 y = 
2x dx + (dx) 2 . When Sx is small compared with x most of the change 
in y is given by the two rectangular areas and only a very small part is 
given by the small square of area (<5x) 2 . 

Hence if the original square has side 2 cm say and the larger square 
side 2-01 cm, then x = 2, x + dx = 2-01, <5x = 0-01. The area of the 
original square is y = x 2 = 4 and the area of the larger square is 
y + dy = (x + <5x) 2 

= x 2 + 2xdx + ((3x) 2 
= 4 + 0 04 + 0 0001. 


The increment dy = 0-04 + 0-0001. Hence if 2x<5x = 0-04 is used as 
an approximation to dy there is an error of 0-0001 square centimetres 
which is just under \% of dy. 

Since the term (dyjdx)dx usually gives a good approximation to dy 
when dx is small compared with dy/dx, it is customary to use the 
approximation 


dy^-fdx 


or 


df—f'(x 0 )dx 


in numerical calculations. 


(17) 

(18) 
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Example Using the approximation in equation (17) determine a reason¬ 
able approximation to (2-98) 3 . 

Solution Let y = x 3 and start from the point x = 3, y = 27, 
x + dx = 2-98, y + dy = (2-98) 3 . 

Using equation (17) to find an approximate answer, with Sx = — 0 02 
at x = 3, dy — ( dy/dx)dx = 3x 2 dx 

= -0-54 

hence (2-98) 3 = y + dy === y + ( dy/dx)dx 

= 27 - 0-54 

or (2-98) 3 = 26-46. 

2.4.1 The differential 
Equation (16) may be written 

df = f'(x 0 )dx + edx. (19) 

If f\x 0 ) =£ 0, the differential df at x = x 0 is defined by 

df=f\x 0 )dx (20) 

and is represented by TR in Fig. 2.3. The differential of the function x 
dx = dx (21) 

so that equation (20) may be written 
df=f(x 0 )dx 

or dy = f'(x 0 )dx. (22) 

Thus the differential dy at x = x 0 is the amount of change in y along 
the tangent line at the point x 0 which would be produced by a change 
dx in x; dyfdx is the rate of change of y per unit change in x. 

2.5 APPLICATIONS OF DIFFERENTIATION TO CURVE 
SKETCHING 

2.5.1 The sign of f(x) 

Let y — /(x) be continuous and differentiable in the range a < x < b. 

If /(x) increases monotonically in this range, for any value of x in 
a<x<b, [f(x 4 - h) — f(x)]/h has the sign of/'(x) provided h is small 
enough, and hence f'{x) > 0 in the range. 

Similarly if fix) decreases monotonically, then f'(x) < 0 in a < x < b. 
See section 1.1.4 and Figs. 2.5(a) and 2.5(b). 

2.5.2 Concavity 

Let/'(x) and /"(x) both exist in the range a < x < b. The graph of 
y = fix) is said to be concave downwards if/'(x) decreases, i.e./"(x) < 0 
as x increases through the range. The graph of y = f(x) is concave 





(a) (b) 

Fig. 2.7 Graphs of y =f(x) concave upwards in a < x < b 


When the graph is concave downwards the tangent at each point is 
‘above’ the curve, and when the graph is concave upwards the tangent 
is ‘below’ the curve. 
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2.5.3 Maxima, minima 

A maximum value of a continuous function is one which is greater 
than the values in the immediate neighbourhood. A minimum value is 
one which is less than those in the immediate neighbourhood. Thus the 
maximum and minimum values of a function refer to local properties 
and it is possible for a maximum value to be less than a minimum 
value. See for example. Fig. 2.8 where the maximum value at B is less 



than the minimum at A. Moreover, maximum and minimum values are 
not necessarily the greatest and least values of the function in any given 
range. For example, in Fig. 2.8, the greatest and least values occur at 
the beginning and end of the range. 

A point where a maximum or minimum occurs is called a turning 
point. 


2.5.4 Conditions satisfied at a turning point 

Suppose f(x ) is differentiable in a < x < b and that there is a maximum 
or minimum value of/(x) at the point x = x 0 . Then if f'(x 0 ) =£ 0, h > 0, 
{/(*„ + /;) — f(x 0 )}/h can assume both positive and negative values in 
the neighbourhood of x 0 provided h is small enough. Hence at x = x 0 
the function is not differentiable. But f{x) is differentiable at x = x 0 
and the assumption /'(x 0 ) ^ 0 is false. Hence /'(x 0 J = 0 is a necessary 
condition for a turning point, and these are located by solving the 
equation/'(x) = 0, i.e. by finding those points on the graph ofy = /(x) 
where the tangent is parallel to the x-axis. 

This condition is not sufficient to discriminate between a maximum 
and a minimum value. If there exists x 0 such that f'(x 0 ) = 0 then 
usually /'(x) changes sign as x increases through the value x 0 . If f(x) 
changes sign from positive to negative then /(x 0 ) is a maximum value. 
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and if f'(x) changes sign from negative to positive then /(x 0 ) is a 
minimum value of the function. See Fig. 2.9. 

The use of the sign of /'(x) to the immediate left and immediate right 
of a root of /'(x) = 0 is an infallible method of distinguishing between 
maximum and minimum. 



(a) Maximum at x = x 0 (b) Minimum at x = x 0 

Fig. 2.9 


Alternatively the sign of f"(x 0 ) may be investigated in order to 
discriminate between maximum and minimum. At a maximum turning 
point the curve is concave downwards, i.e./'(x) decreases as x increases 
through the value x 0 , or f"(x 0 ) < 0. Similarly at a minimum turning 
value the graph is concave upwards, /'(*) increases as x increases 
through the value x 0 , i.e./"(x 0 ) > 0. 

2.5.5 Conditions for a point of inflexion 

Suppose f"(x 0 ) = 0, then usually f"(x) changes sign as x increases 
through x 0 . If f"(x) changes sign from positive to negative, x = x 0 is 
a transition between /"(x) > 0 (i.e. /'(x) increasing, fix) concave 
upwards, and tangent ‘below’ the curve) and /"(x) < 0 (i.e. /'(x) 
decreasing, /(x) concave downwards, and tangent ‘above’ the curve). 
Such a point of transition is called a point of inflexion and the tangent 
there which ‘crosses’ the curve is called the inflexional tangent. 

Similarly if/"(x) changes sign from negative to positive as x increases 
through x 0 , then (x 0 ,/(x 0 )) is a point where the graph ofy = /(x) changes 
from being concave downwards to concave upwards and there is a 
point of inflexion at x = x 0 . 

2.5.6 Summary 

Let y = /(x), then 

(i) Conditions for a turning point at x = x 0 are /'(x 0 ) = 0 and /'(x) 
changes sign as x increases through x„. 
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(a) f(x) changes from positive to negative (or f"(x 0 ) < 0) gives a 
maximum at x = x 0 ; 

(b) fix) changes from negative to positive (or /"(x 0 ) > 0) gives a 
minimum at x = x 0 . 

(ii) Conditions for a point of inflexion at x = x 0 are /"(x 0 ) = 0 and 
fix) changes sign as x increases through x 0 . If fix 0 ) = 0 and fix) 
does not change sign as x increases through x 0 , it is necessary to con¬ 
sider the sign of fix) to the left and to the right of x„. 

Points at which/'(x) = 0 are called stationary points, which may or 
may not be turning points. 

Example (i) Investigate the stationary points of f(x) = x 3 . 

Solution f(x) = x 3 , fix) = 3x 2 , fix) = 6x. Now, fix) — 0 when 
x = 0, fix) > 0 for all values of x so that there is no change of sign 
of fix) and hence neither maximum nor minimum. 

Also fix) = 0 when x = 0, and fix) changes sign from negative 
to positive as x increases through the value x = 0. Hence the stationary 
point (0, 0) is a point of inflexion, and the inflexional tangent is the 
line y = 0. See Fig. 1.8 for the graph of y = x 3 . 

Example (ii) Investigate the stationary points of/(x) = x i . 

Solution fx) = x 4 , fix) = 4x 3 , fix) = 12x 2 . Now fix) = 0 when 
x = 0 and /"(0) = 0. But when x = 0 there is a minimum value of 
fix), since f\x) changes sign from negative to positive as x increases 
through the value zero. 

Example (iii) Investigate the inflexion point on the curve y = sin x, 
-iTT < X < in. 

Solutiony = sin x,/ = cos x,y" = —sin x. Now/ = 0 when cos x = 
0 which has no solution in the given range. Also / = 0 when sin x = 0 
and x = 0 is the only value in the range. Moreover y" changes sign 
from positive to negative as x increases through the value x = 0. Hence 
(0, 0) is a point of inflexion. The slope of the inflexional tangent is 1, 
since/(0) = 1, and its equation is y = x. See Fig. 1.17(a). 

2.6 DIFFERENTIATION OF INVERSE FUNCTIONS 

Let y = fix) be differentiable and monotonically increasing in 
a < x < b. Then the inverse function x = <f>iy) can be shown to be 
differentiable for all value of y between fa) and f(b), provided that 
fix) # 0. 

In Fig. 2.10, letP be the point (x^/x,)), then x, = fy^ and P can 
be considered as the point ($/), j x ) on the graph of x = <j>(y). In effect 
the same graph holds for both functions. If fix) 0, the tangent at P 
to y =/(x) is not parallel to the x-axis, which also means that the 



DIFFERENTIATION 39 

tangent at P to x = </>(y) is not perpendicular to the y-axis, and <j>(y) 
is differentiable at P. 

Let tan ip be the gradient of the tangent at P so that /'(*i) = tan ip. 
Let 6 be the angle made by the tangent at P to the y-axis so that 
= tan 6. Since 0 = \-n — ip, tan 0 = 1/tan ip, or 


dy 1 
dx ~ (dxjdy) 



2.6.1 Differentiation of the inverse circular functions 

(i) The inverse function y = sin" 1 x means 

x = siny and —far <y < {n 

and hence, 

dx 

T y =co% y 

= ±V0 — sin 2 /) 

= ±va-* 2 ). 

In the range — Jtt < y < \n, cos y is positive so that 

dy _ 1 

dx VO — x 2 ) 

or 1™"* = vcrh?)'W <L 

(ii) The inverse function y — cos -1 x means 

x = cos y and 0 < y < tt. 
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and — = — sin y 

— TV(1 — cos 2 j) 

= T V (1 - * 2 )- 

In the range 0 < y < it, sin y is positive, dxldy is negative and 

^ cos - lx = _-_L_ W<1 

(iii) The inverse function y = tan -1 a means 

x = tan j and — Jtt < \n, 

i dx . 

and = sec 2 y 

= 1 + tan 2 y 
= 1 + x 2 
d , 1 

and ~r tan -1 x = 7 ——y 

dx 1 + x 2 

Example (i) Show that arccos x + arcsin x = \n. 

Solution Let /(x) = arccos x + arcsin x, 

the " t-M - + vo~b?)= °- 

Hence /(x) = c for all values of x, where c is some constant. Writing 
x = 0 , 

/(0) = ^ + 0 = C, 

i.e. arccos x + arcsin x = \n. 

Example (ii) Differentiate (a) arcsin x 2 , (b) arcsin (cos x). 

Solution 

(a) £ {arcsin**} - X 2 * 

2 x 

- V(1 - * 4 ) 

d d 

(b) — {arcsin (cos x)} = ^ (arcsin /} where t = cos x, 

d . . . dt 

= ^ {arcsin 1 } X 

1 . dt 

~ V(1 - / 2 ) * dx 
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—sin x 

V(1 — cos 2 x) 


—sinx 
|sinx| 


if sin x > 0 
if sin x < 0. 


2.7 THE PARTIAL DERIVATIVE 

The derivative of a function of two variables with respect to either is 
called the partial derivative. Suppose co = f(x, y) is a function of the 
two independent variables x and y defined over the region R in the 
(• xy ) plane and consider the expression 


/(*o + h,y 0 ) -f(x 0 ,y 0 ) 
h 


If this tends to a finite limit as h -> 0, its value is denoted by f x {x 0 , y 0 ). 
It may be called the rate of change of/(x, y) with respect to x when y 
is held constant at the point (x 0 , j 0 ) in R. The function f x {x, y) is called 
the partial derivative of f(x, y) with respect to the variable x, i.e. 




when this limit exists. Other notations are {i£\ or (—^ to 

, . . . 8f \P X ) y constant \P X ) v 

emphasize y is fixed, —, and /i(x, y) this last meaning the partial 

derivative of/with respect to the first variable mentioned. If no am¬ 
biguity can arise, the notation dto/dx where a> =/(x,y) is also used. 
Similarly 


- 1 - 

is the partial derivative of/ with respect to y and may be regarded as 
the rate of change of/with respect to y when x is considered to be held 
fixed. Then the ordinary rules of differentiation apply. 

Example (i) ~ sin (2x 2 - y) = 4x cos (2x 2 - y) 

d 

T y SIn (2x2 ~ y> * - cos ( 2 * 2 - y)- 
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Example (ii) x = r cos 0, r and 0 being independent variables 
^ = cos 0 (0 considered constant), 

~ — —r sin 0 (r considered constant). 


Suppose also that y — r sin 0, then 

dy . dy 

— = sm 0, - Q = r cos0. 

In this example, r = V(* 2 + J 2 ) (x, j independent) so that 

= // —sc (V considered constant) 

dx \/(x + y) 


Sx 1 

1,e - dr * (dr/dx) 

In calculating dx/dr, x is a function of the independent variables r and 0 
and 0 is considered as held fixed; in calculating dr/dx, r is a function 
of the independent variables x and y and y is considered fixed. Thus 
( dx/dr) g and ( dr/dx) v are not necessarily reciprocals. 

Note however, that from x = r cos 0 if 0 is held fixed, 

(!), = “» 9 and (D.-SST8 

so that 

(-) = —• 

\dr/ e (dr/dx)e 


2.7.1 Higher derivatives 

Since f x and f y may themselves be functions of x and y they may under 
certain circumstances be differentiated partially with respect to x and y. 

If exists as a limit it is denoted by 

/.>(*,» O' or f„(x,y) 


or M*-# or 


Similarly 
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Also 

8 (8f\ 8 2 f 

8-y\Tx) = dyiTx or M*>y) or M*.y) 

8 (8f\ 8 2 f 

8~x\Vy) = 8^8y ° T ° r 

These last two are often called ‘mixed’ derivatives. 

2.7.2 Extension to functions of three or more variables 

The definition of the partial derivative of a function of two variables is 
extended to a function of three or more variables. For, let V(x, y, z) 
be a function of the independent variables x, y, and z, then 
'8V T . V(x + h, y, z) - V(x, y, z) 

8x ~ h 

i.e. V is differentiated partially with respect to x; y, and z being con¬ 
sidered constant. This may be written 



Similarly if V(x ly x 2 , x 3 ,. . ., x n ) is a function of the independent 
variables x ly x 2 , x 3 . x n , 

— = Lim V ^ Xl + h '* 2 ’ X3 ’' ' •’ Xn ^ ~ V ( Xl ’ * 3 ’ • • •’ *") 

& X 1 h-+Q h 

i.e. V is differentiated partially with respect to x u the other n — 1 
variables being held constant. 

Example (i) f(x, y) = x 2 sin (2x 2 — y) 

8f 

7^ = 2x sin (2x 2 — y) + xHx cos (2x 2 — y) 

8f 

Q- y = -^ 2 cos(2 x 2 -y) 

8 2 f 8 /8f\ 

d^ = fa[frc) =2sin ( 2x2 -y) + 2x4x cos ( 2x * - y) 

+ 12x 2 cos (2.v 2 — y) — 4x 3 4x sin (2x 2 — y) 

— (2 — 16x 4 ) sin (2x 2 —y) + 20x 2 cos (2 jc 2 — y) 

$-*($)--****-» 

8 2 f 8 I8f\ 

ty8~* = sy\8x) = ~ 2x cos (2x2 ~ y ) + 4x3 sin (2 * 2 - y ) 

& y = Tx (|) = ~ 2x cos (2 * 2 - y> + 4x3 sin ( 2x2 - y)- 
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In this example the mixed derivatives f yx and f xy are equal, which 
suggests that the operations d/dx and d/dy are commutative. This is not 
always true, but it is possible to prove that when f yz and f xy are both 
continuous they are equal. 


Example (ii) If z is a function of x and y determined by the equation 
z 3 — 3 yz — 3x = 0, 

show that 

& dz . _ [ d 2 z , (8z\n 8 2 z 

(a) z T x ~y “ d 

(Assume z 2 — y ^ 0.) 

Solution 


‘ df 


z 3 - 3/z - 3x = 0 
(a) Differentiate (i) partially with respect to x to give 

3 ^- 3 ^~ 3 = ° 


8z 1 
8x ~ z 2 - y 


and with respect to y to give 


3 ^- 3 ^" 3z = ° 


hence 


8z _ z 
z 2 — y 


dz _ dz 
2 dx~ dy 


(ii) 


(b) Differentiate equation (ii) partially with respect to y 

dz dz d 2 z d 2 z 
dy dx Z dy dx ~ dy 2 

substitute for dz/dy from equation (ii) and 

Idz\ 2 d 2 z _ d 2 z 

Z \8xJ + Z dydx^ df 

which gives the required result when 8 2 z/8y dx is replaced by 8 2 z/8x dy. 

Notice that ordinary rules of differentiation of constants, of products, 
and of function of a function are used. 
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EXERCISE 2 

1 Find /'(l),/'(0) for the function f(x) — x 2 + 1/(1 + x 2 ). 

2 Find /'(0), f'd 77 ) for the function sin 2x + cos 3 x. 

3 Differentiate 

(i) y * tan jc ’ ® V(1 + sin 2 x ); (iii) sin 2 x cot 2x. 

4 Show that the gradient of the curve y = x 3 + 6x 2 + 15x + 28 is 
positive for all values of x. Show that the curve has a point of 
inflexion when x = —2 and state the gradient of the curve at this 
point. Sketch the graph of y = x 3 + 6x 2 + 15x + 28. 

5 Find dy/dx as a function of t if x = t + sin t and y = 1 — cos t. 
Show that d 2 y/dx 2 = 1/(1 + cos t) 2 . 

6 Find dy/dx in terms of t when 

1 - t l- It 

*=r+i? 

and show that )’. 

7 Find dy/dx at the point (2, 1) on the curve given by 

x 3 - 3 xy + y 3 = 3. 

8 Find the equation of the tangent at the point (1, 0) on the curve 

3x 3 + 2xy 2 = 2y + 3. 

9 Find the coordinates of the points on the curve x 2 — xy + y 2 = 12 
at which the gradient is zero. 

10 Show that the nth derivative of cos x is cos (jc + £mr), and find the 
nth derivative of (x 3 + 1) cos 3x when x = 0. 

11 Find the coordinates of the turning points on the curve 
y = x + 1—2 cos x in the range 0 < x < 2ir, distinguishing 
between the maximum and the minimum. Show that the tangent to 
the curve where x = 0 is also the tangent at the point where x = 27r. 
Draw a sketch of the curve in the range 0 < x < 27 t. 

12 Find the values of f x , f y , f XI , and and verify that f xy = f yx for 
the following functions. 

(i ) x/y; (ii) (x 2 + y 2 ) -1 ; (iii) x cos y + y cos x. 

In (i) and (ii) assume x ^ 0, y ^ 0. 

13 Show that, if u = l/r n and r 2 = x 2 + y 2 + z 2 then 

d 2 u 8 2 u d 2 u n(n — 1) 
dx 2 + df + d?~ r n+2 ’ 


^0. 
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14 If u and v are functions of the independent variables x and y which 
satisfy the equations u 2 — v 2 + 2x + 3y = 0 and uv + x — y = 0 
and m / 0, u 5 ^ 0, find du/dx, 8uj8y, dv/dx and dv/dy in terms of u 
and v. 

15 Show that if V(x,y, z) = x/(x 2 + f + z 2 ) 3 ' 2 

, 8V 8V 8V , , , 

then x -+y-+z Tz = - 2V, x 2 + y 2 + z 2 ^ 0. 

16 If F = x n f(y) where n is a constant and 

8 2 V 28V 1 8 2 V 

8x 2+ x8x + x 2 8y 2 “ ° 

show that 


^ + nin + l)f(y) = 0, (x # 0). 



3 

Infinite Series and Convergence 


3.1 LIMIT OF A SEQUENCE 

Consider an infinite sequence of numbers denoted by {u n } or a 1 , a 2 , a 3 , 
. . ., a n ,. . . where n is a positive integer. The value of any term depends 
on its position in the sequence, and there is no last term. 

Example (i) 0, f, f, . . ., 1 n > 1. 

Example (ii) 1, 1 £, If, . . ., 2 - 1/2",. . ., n > 0. 

Example (iii) 1, f,. . 1/2",. . ., n > 0. 

Example (iv) y, y 2 , y 3 . y n ,. . ., n > 1. 

Example (v) 1, —2, 3, —4, . . ., (—l)" +1 n, . . ., u > 1. 


i 


0| i 2 3 4 


Fig. 3.1 

Graphically each of these examples may be represented as points on 
a straight line, or as functions of n by a ‘graph’ which is a set of isolated 
points defined for integer values of n only. Figure 3.1 shows the ‘graph’ 
of {y n } when y = — 1 and Fig. 3.2 shows the graph of the sequence 
{1-1/"}. ' ' 
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A sequence approaches a limit / if when any positive number, say h, 
however small, is chosen, then eventually all the terms of the sequence 
lie in the range / — h to / + h, i.e. a n -»■ / as n -»■ oo if it is possible to 
choose h so that all the terms of the sequence after the Mh, lie in the 
range / — h to / + h, where N depends on the value of h chosen. Such 
a sequence is illustrated in Fig. 3.3. 



"ol N n 

Fig. 3.3 

For example, consider the sequence a n = 2 — 1/2", choose h — 1/10 3 . 
All the terms of the sequence after the tenth are within the range 
2 — 1/10 3 to 2 + 1/10 3 and hence the limit is 2. 

In the examples above, sequence (i) has limit 1, sequence (ii) has 
limit 2, sequence (iii) has limit 0. The behaviour of sequence (iv) 
depends on the value of y; it will be shown in section 3.3.1 that the 
limit is 0 if — 1 < y < 1, but that if y is outside this range there is no 
limit. Sequence (v) has no limit. 

A sequence of positive terms {a„} is said to tend to infinity if for 
every choice of a number A, however large, a n is greater than A for all 
large values of n. This is written a„ —>- oo as n -*■ oo. Similarly a sequence 
of negative terms a n -*■ — oo as n oo if it is possible to choose a 
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number, say —B where B is any positive number, such that all the 
terms of the sequence beyond a certain point are less than — B. A 
sequence which does not tend to a limit or to ±00 is an oscillating 
sequence. For example, if a n = y n and y = — 1, the sequence —1, 1, 
— 1, 1, — 1, . . . is an oscillating sequence. 

A sequence is mono tonic increasing if a n + 1 > a n for all values of n. 
Similarly a sequence is monotonic decreasing if a n+1 < a„ for all 
values of n. 

A sequence is bounded above if there is some number C, such that 
a n < C for all values of n, and it is bounded below if there is some 
number E, such that a n > £ for all n. Sequence (i), 

0, h f__ 1 - 1 In, . . . 

is monotonic increasing and is bounded below by 0 and above by 1. 
Sequence (v), {(— l) n+1 «} is bounded neither above nor below. 

The most fundamental theorem on monotonic sequences is the 
following, which is stated without proof. An increasing sequence which 
is bounded above tends to a limit; a decreasing sequence bounded 
below tends to a limit. An increasing sequence which is not bounded 
above tends to infinity as n tends to infinity, a decreasing sequence 
which is not bounded below tends to minus infinity as n tends to infinity. 

Suppose l and /' are the limits of the sequences {tf„}and {b n } respectively. 
Then a n = / + k n and b n = l' + k' n where n depends on the values of 
k n , kn chosen. It is not difficult to show that 11 + fil’ is the limit of the 
sequence { la n + ub n }, where 1, ju are any constants. For it is possible 
to choose a positive number h, however small, so that for sufficiently 
large n, \lk n \ < \h and \fik' n \ < \h, and hence 

| la n + pb n — (11 + fil ')| = | lk n + fjJc'nl < h. 
that is, la n + f.ib n -*-11 + (iV as n->- co 

In particular, if 1 = 1, fi — — 1 then a n — £>„—>• l — /' that is, the 
limit of the sequence formed from the difference of two sequences is 
the difference of the limits. 



Hence the sequence is monotonic increasing. Also a n < 1 for all n 
and so the sequence is bounded above and tends to a limit as n ->- co. 
{Moreover lim a n = 1. 
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Example (ii) Examine a n = 1/2". 

Solution Here, a n+1 < a n , since l/2 tt+1 < 1/2", all n, and a n > 0 for 
all n. That is, the sequence {a„} is decreasing, bounded below, and 
hence has a limit. The limit is zero. 

Example (iii) Examine a n = n. 

Solution Here a n+1 >a n , the sequence is mono tonic increasing 
bounded below by zero and is not bounded above. Hence a n -*■ oo as 
n — > go. 


Example (iv) Examine the sequence {a n } = {(1 + l/«) n }. 

Solution The general term is 

Consider any value p> n, then 

Now a p > a n , since a p consists of more terms than a„ which are all 
positive, and in those which correspond, those in a v are the greater, for 
example: 

o(‘-j)>rM-2) 

/ . 1 1 . 1 . 1 \ 

^since p> n,-<-,\ — - >1 — -y 

Therefore {a„} is an increasing sequence. Moreover the sequence is 
bounded above since 


1 1 a 

a„<l + l + 12 + 12 3 + . . .+ 
<1 + 1 + 2 + 22 + ... + 2^1 
= 1 + = 1 + 2 (1 - Q) B ) 

< 3 for all values of n. 


1 

1.2.3 . . .n 
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Hence a n approaches a limit since it is an increasing sequence bounded 
above. This limit is denoted by the constant e. 


3.2 CONVERGENCE OF SERIES 


3.2.1 Sum of a series n 

A series is the sum of terms of a sequence. The series % a n having a 
finite number of terms is called a finite series and has a finite sum. The 
series 2 a n having an infinite number of terms is called an infinite series 

and may or may not have a finite ‘sum’. 

The sum of the first n terms of a series is denoted by s n and is called 
the nth partial sum. Thus = a x , s 2 = «i + a 2 , s 3 = a x + a 2 + a 3 , 
. . ., s n = a x + a 2 + a 3 + . . . + a n . For example, consider the 
infinite series 


1 + 2 + 4 + 8 + > • - + 2"-i + - • •’ 
= 1 , *-!*,*« 1 *,. • = 


These partial sums themselves form a new sequence which has the 
limit S = 2 as n -+ oo, and this limit of partial sums is called the ‘sum 
to infinity’, or just the sum of the infinite series. This is written 


1 + £ + (£) 2 + ••• + (£)" + ••• = 2 . 


The ‘equals’ sign is used here to mean the limit of the partial sums. It 
does not mean that a large number of terms of the series may be added 
to give the sum 2 , for there is no value of n such that the sum of n 
terms is 2 , nor is 2 a member of the series. 

When the sequence of partial sums {.s , n } tends to a limit S as n -*■ co, 
the series is said to be convergent, or to converge and the limit S is 
called the sum of the series. When the sequence of partial sums does not 
converge, the series is divergent, that is if s n -» oo or s n -*■ — co or if 
s n oscillates. 


3.2.2 Conditional and absolute convergence 
When the series 

a i + a s + u 3 + . . . + a n + . . . (1) 

consists of some positive and some negative terms it may be conditionally 
convergent or absolutely convergent. 

If the series 

M + M + \a 3 \ + . . . + Kl + • • • (2) 

converges then, as shown below, so also does the series ( 1 ), and ( 1 ) 
is then said to be absolutely convergent. 
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If series (1) converges and series (2) does not converge, then (1) is 
said to be conditionally convergent. For example the series 

l-i + i-h + . ■ . 

is conditionally convergent, since it converges (see section 3 . 5 . 3 ), yet 
the series 

l + * + i + i + . • ■ 

may be shown to be divergent (see section 3.3.2). The series 

1 _ 1 1 1 

1 ~ 2 2 + 3 2 ~ 4 2 + ' ' ' 

is absolutely convergent since the series 

1 1 1 1 

1 + 2 a "b 33 "I" 42 "b • • • 

may be shown to be convergent. See example (iii) of section 3.5.1. 

To show that if (2) converges then so does (1), write u n = \{\a n \ + a n ) 
and v n = i(\a n \ — a n ). When a n > 0 ,u n = a n and v n = 0; when a n < 0, 
u n = 0 and v n — —a n , so that whatever the sign of a n , u n > 0 

and v n > 0. Also |a n | = «„ + v„, a n = u n — v n , u n < |«r B |, and 2 \ a *\ 
is convergent. 1 

Suppose J |o»| = A, then ^u r < A for all values of n. Hence the 
partial sums j ,u r tend to a limit, i.e. 2 M n is convergent. Similarly ^v n 
is convergent. Using the result that the limit of a sequence {«„ — v n } 

is the difference of the limits, leads to the result that = £(«„ — v n ) 
is convergent. 1 1 

3.3 SOME STANDARD INFINITE SERIES 
3.3.1 The geometric series 

l + y + y 2 + . . ■ +y n + . . . = f, y n . 

The nth partial sum, s n is given by 

= 1 +y + y 2 + . . . +T n_l 


L nl 

l-y 
1 1 



When — 1 <y < 1, then j n ->0 as n-* 00 and 1/(1 — y), so 
that, by the definition of convergence of series in section 3.2.1 the 

geometric series is convergent and has the sum 1/(1 — y). 
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If y < — 1 or y > 1, y n does not tend to a limit as n -*■ oo and hence 
neither does s n . 

If y = 1 the series is 1 + 1 + 1 + . . and s n = n -* oo with n. 
Hence, if / < — 1 or / > 1 the geometric series 2 y n i s divergent, 
if — 1 < y < 1 it is convergent. 

3.3.2 The harmonic series 
The series 

1 + i +J + i + - • • = I !/« 

is sometimes called the harmonic series. This is divergent, for although 
a n = 1 In -*■ 0 as n ->■ oo, it is possible to prove s n increases beyond all 
limit. The sum of the first four terms 

l+*+i+i>l+*+i+i=l+i+f 

The sum of the next four terms 

i + J+ i + i>i + i + i + i= 8 = i- 

The sum of the next eight terms is > 8/16 = \ and so on. Hence 
^ n >l+i + i + i + . . • 

to k terms where n = 2 fc_1 and increases beyond all limit as n -► oo. 
Hence the series is divergent. 
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3.4 SOME RULES 
3.4.1 Rule 1 

Let 4- 02 + o 3 + . . . (3) 

be convergent and its sum be 5. Then L i m a n = 0 since a n = s n — s n . x 
and s n and j„_i have the same limit S as n -*■ oo, 

Lim a n = S — S = 0. 


This property must not be taken as a test of convergence. If a series 
converges it is necessary that the nth term tends to zero, and if the nth 
term does not tend to zero there is no need to test further for con¬ 
vergence. That is, the converse of this rule is a test of divergence. The 

series 2 a n is divergent if a n does not tend to zero as n tends to infinity. 
For example, the series ^ was shown to be divergent in section 

n=l 

3.3.2, yet a„ = 1/n 0 as n -> oo. The series 


diverges since 


‘ n + 1 1 + 1/n 


3.4.2 Rule 2 

Consider the remainder R m , after m terms of the convergent series (3). 
This is defined to be the remainder when the first m terms are removed 
and is 

r m = 2 + a m+ 3 + . . . (4) 

The series (4) converges for all integer values of m and its sum is 
S — s m . This means that a finite number of terms at the beginning of 
a series may be removed without altering the convergence of the 
series. The proof depends on the fact that the nth partial sum of the 
series (4), plus the terms removed, is a partial sum of the original 
series (3), which has the limit S. Hence the limit of the nth partial sum 
of (4) is S — s m , and hence (4) is convergent. 

Similarly if a finite number of terms is added to the convergent series 
(3) the resulting series is convergent and its sum is S plus the sum of 
the terms added. 

As a corollary, if 2 a n is convergent, 

Lim R m — Lim (a m+x + a m+2 + a m+3 + ...) = 0. 

since R m = S — s m which -> 0 as m -> oo. 
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3.5 TESTS FOR CONVERGENCE 

In order to prove that a series converges it is sometimes possible to 
find its sum. This is not always possible and tests have been devised to 
discover whether a given series is convergent or not. 

3.5.1 The comparison test 

Let 2 a n and 2 b n be two series of positive terms. If the second series 
is convergent and a n < cb n for all values of n where c is any constant, 
then the first series is also convergent. For if 

+ b 2 + b 3 + . . . = B, 

then + 62 "b ^3 " 1 " • • • b n ^ B for all n, 

and a x + a 2 + a 3 + . . . + a n «£ cB. 

Hence the partial sums of the first series tend to a limit and 2 a n is 
convergent. 

If the series 2 b n is divergent and a n > cb n for all values of n and c 
is a constant, then 2 a n also diverges. 

Example (i) Prove 2 1/V« is divergent. 

Solution Write a n = l/V n > choose b n = 1 /«, then since \/n < n for 

all values of n, - 4 - > -> or a n > b n for all n. 2 b n is known to be 
y« n 

divergent and hence 2 a n is also divergent. 

Example (ii) Prove that 2 jjj is divergent when p < 1. 

Solution Let a n = l/n p , p < 1 and choose b n = l/n, then since n v < n 
for all values of n when p < 1, a„ > b„ for all values of n. But 2 b n 
is divergent and hence 2 fl » is divergent, i.e. 2 -7 is divergent when 
v<\. n 

Example (iii) Prove that 2 l /« 2 is convergent. 

Solution Write a n = - —= and choose b n = —r~r;- Since 
(n + l) 2 n(n + 1) 

(n + l ) 2 > n{n + 1 ) for all values of n, then ^ ^ ^ for 
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all values of n. It is known that T -7 is convergent (see section 

j n=l n \ n + 1) 

3.3.3), hence 2 ^ qp is convergent. Now the given series can be 
written 


Hence, using Rule 2 of section 3.4.2, 2 1/n 2 is also convergent. 


Example (iv) Prove that the series 2 l/n p is convergent if p > 2. 
Solution Let a n = l/n p , p ^ 2, and choose = 1/n 2 . Since 
n p > n s , /? > 2, 

then -4 < -4, P > 2 and all «. 

n p n 2 ^ 

Hence, since 2 l/” 2 is convergent, so also is 2 l/n p when p > 2. 

It is also possible to prove that 2 1 /«” is convergent when 1 < p < 2, 
using a test for convergence which is not included in this book. The 

series 2 1 /« p , which is convergent for p > 1 and divergent for p < 1, 

is used as a standard series in the comparison test. The choice of 
standard series in this test is made from common sense and experience. 
For example, if a n = (2n — 1 )/(n 4 + 1), a n is of the order of 1/n 3 , 

since when n is large, a n — 2n/n 4 = 2/n 3 , and the series 2 a n is compared 
with the convergent series 2 1/n 3 , as follows: 

_ 2n - 1 _ 2 /n 3 - 1/n 4 _2 _ 2 x - 
— n 4 + 1 — 1 + 1/n 4 < n 3 — X n 3 

2 1/n 3 converges, and hence so also does 2 

3.5.2 The ratio test 

If j-^J -► r < 1 as n —oo then 2 a n is absolutely convergent and 

hence convergent. If r > 1 the series diverges; if r = 1 the test fails. 

For suppose r < 1 then it is possible to choose h such that all the 
terms of the sequence \a n+1 /a n \ after the Mh lie in the range r — h to 
r + h. Choose h = L(1 — r), then 

r + h = HI + r) < 1 

and pw+i <r + h, pLt? < r + h etc. 

I o N | |«w+i| 
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or |a w + 1 | < (r + h)\a N \, |«w+a| < (r + *)K+i| <(r + h) 2 \a N \ etc. 

and 2 l fl nl = l fl *l + + \ a N+z\ + - • • 

< 1 ^at |{1 + (r + h) + (r + h) 2 + . . .}. 

Hence 2 l fl n| is convergent by comparison with the geometric series 

(r + h < 1). Adding to this convergent series the finite number of 
terms |aj| + |a 2 | + |a s j + ... + |a w _i| and using Rule 2 (section 3.4.2) 

shows that 2 ! a n| is convergent. From section 3.2.2 it follows that 2 a n 

is absolutely convergent and hence convergent. 

If r > 1 then, for n sufficiently large, |a» +1 | > \a n \ and the magnitude 
of the terms increase with n. Hence Lim a n =£ 0, and by the converse of 

Rule 1 (section 3.4.1) the series 2 a n is divergent. 

Example (i) Prove that the series 2 « 2 /3 n is convergent. 

Solution Write a n = n 2 / 3 B , then 

1 g,+i 1 (n + l) 2 3^ 

I a n \ 3 n+1 ‘ n 2 

1 /, , 'V 1 

_-5l 1+ ;) “*5 “ 

r = \ and the series 2 a» is convergent. 

Example (ii) Prove that the series J 1/n! converges. 

Solution Write a n = 1/n!, then 


| I (» + 1)! 

as 

Hence the series 2“? !=1 + 1 + Ji + J; + - ■ • is convergent. 
Example (iii) Prove that the series 2 2"/« is divergent. 

i 

Solution Write a n = 2 n /n, then 

|a n+1 1 2 n+1 n_ 

I a n I “ n + 1 • 2" 

-TTTfn ^ 2 “ 


Hence 2 o n diverges. 
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Example (iv) Consider the convergence or divergence of the series 

f(« + 2)/(« 2 - 1). 


Solution Write a n = (n + 2)/(n 2 — 1), then 
\a n *\ I n + 3 n 2 - 1 

| a n I n 2 + 2 n n - 1-2 

(1 + 3/i.Xl ~ W) . 

(1 + 2/n)(l + 2jn) 


n-*- co. 


That is, r = 1 and the test fails. This series diverges since 
n + 2 _ 

2 - 1 " 


a n = - 


-,«> 1 


2 1/n diverges, hence 2 « n diverges. 


Example (v) Consider the convergence or divergence of the series 

i(-i) n+i /». 


Solution Write a n = (—1 ) n+1 /« 

|f*n + l I _ n 
I fln I » + 1 

= iTVn ^ 1 


n -> oo. 


That is, r = 1 and the test fails. This series converges as is proved in 
section 3.5.3 following. 

The examples (iv) and (v) emphasize the fact that when r = 1 there 
is a failure of the ratio test and that resort must be made to some 
other test. 


3.5.3 The alternating series 

Consider a series with alternating sign, i.e. a series in which the signs 
are alternatively positive and negative. Let the series be a 1 — a 2 + 

a 3 — a* -\ -. . . where a t is positive for all i. Then if 

Lim a n = 0 and «i > a 2 > a 3 . . . > a n > 0, 

the series converges. 

Let s 2n be the sum of the first 2 n terms. 

s 2n = (a x — a 2 ) + (a 3 — a 4 ) + . . . + (a 2n ^ 1 — a 2 „). 

Each bracket is positive so that s 2n increases steadily, or 
0 < s 2 < < . . .. 
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Also 

s 2 n = cti - {(a 2 - a 3 ) + (a 4 - a 5 ) + . . . + (a 2n . 1 - a 2n _ x ) + a 2n } 
< a x since all the brackets are again positive. 

Hence s 2n is monotonic increasing, bounded above and therefore tends 
to a limit. 

Also s 2n +1 = s 2n + a 2 n+ i -> s 2 „ as n -► oo 

since a 2 n+i —► 0, and thus s 2n +i converges. Hence the sequence {^ n } 
converges and so does the alternating series. For example, the series 
1 — i + i — i + • . ■ converges since a n -> 0 as n —► oo and 
a x > a 2 > a 3 > . . . > a n > 0 for all values of n. 


3.6 POWER SERIES 
A series of the form 

fl 0 + fli* + a 2 x 2 + . . . = 2 a * x " 

n=0 

is called a power series in the variable x, where a 0 , a x , a 2 , . . . are 
constants. If it converges it defines a function f(x) given by f(x) = 
a 0 + a x x + a 2 x 3 + . . . . 

The series always converges when x = 0. It may or may not converge 
for any other value of x. 

Example (i) The geometric series 1 + x + x 2 + . . . converges to 
1/(1 — x) provided |x| < 1 and diverges for all other values of x. 

Example (ii) The series 

1 + * + 5T + 5T + ’ • • 


converges for all values of x since by the ratio test of section 3.5.2 

I * a+1 »!|_ M 


| (n + 1)! ' x n | n + 1 


* 0 as n -*■ oo 


for all values of x. 


Example (iii) Consider the series 


|(» + 1)*X*| "(1 + 1/1!) 


Hence the series converges absolutely when |x| < 1 and diverges when 
|x| > 1. When x = 1 the series becomes 1 — 4 + i — i + . . . 
which converges (see section 3.5.3). When x = — 1 the series 
— 1— $ — $ — J — . . . diverges by comparison with the harmonic 
series (see section 3.3.2). 
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3.6.1 Radius of convergence 
Consider the power series 

a 0 + a ± x + a 2 x 2 + .... 


The number R, such that the power series converges absolutely when 
|x| < R and diverges when |x| > R, is called the radius of convergence 
of the series. The series may or may not converge when |x| = R. See 
example (iii) above. When the series is convergent everywhere, i.e. for 
all values of x, R is infinite. 

If f(x) = a 0 + + a 2 x 2 + . . . + a n x n + . . ., 

then f'(x) = a x + 2a 2 x + . . . + na n x n_1 + . . . 


for every value of x inside the range |x| < R where R is the radius of 
convergence of the first series. That is, it is possible to differentiate a 
power series, whose sum is f(x), term by term within the interval 
defined by its radius of convergence, and the differentiated series has 
as its sum the derivative of f(x). Both series have the same radius of 
convergence. 

Hence, a power series with radius of convergence R may be differen¬ 
tiated or integrated term by term within its radius of convergence. 

The existence of R and the statements above require proofs that are 
beyond the scope of this book. 

Example (i) Show that the series 


converges for all values of x in the interval — 1 < x < 1. 


Solution Using the ratio test, 



= _Jil_* 1*1 

(1 + l/«) 11 


n -*■ oo. 


Hence the series is convergent provided |x| < 1 and is divergent when 
|x| > 1. When x — 1, the series is 1 + | J + . . . which is diver¬ 
gent (see section 3.3.2). When x = — 1 the series is 

— 1+i — 1 + i-h. • • 


which is conditionally convergent by comparison with 
1—i + J — i-1-. . .. 

Thus the given series converges in the interval — 1 < x < 1 and 
diverges everywhere else. Its radius of convergence is R = 1. 

The result of differentiating this series term by term is 
l+ x + x 2 + . . ., 
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which is the geometric series which converges for —1 < x < 1. The 
differentiated series converges at all interior points of the original 
region of convergence but not at the end point x = — 1. 

On the other hand, integration of the geometric series term by term 
gives a series which converges at one new point, x = —1, at which 
the original (geometric) series diverged. 

Example (ii) The series 

0 ° Y« v*2 y 3 

„? 0 ;rr 1+ * + 2i + 3T+--- 


converges for all values of x. See example (ii) of section 3.6. Differenti¬ 
ating this series term by term gives 


X‘ 

1 + x + 2 

2 JC 3 

! + 3! + - • • 

which is the same series. Hence the function defined by this series is 
equal to its derivative and converges for all values of x. 

EXERCISE 3 

1 Do the following sequences have limits? If so, state these limits. 
Give reasons for your answer in each case. 

(i) 0, 1, 0, -i 0, i, 0, -i, . 

. 0, l/2 Sn , 0, — l/2 an+l , 0, . . . 

(ii) 1, i 2, J, 3, . . ., in,.. 


2 For each of the following a n determine whether the sequence {a„}, 
so defined, has a limit, and find that limit when it exists. 

... 2n + 1 

(1) rr^ 

(vi) sinn 

® W TS 


(“O' + ^r 

<viii) irB 

, 1 +(-!)" 

(iv) - 

v ' n 

, 3- + 1 
(,X) 2mTT 

(v) 1 +(-!)» 

, , 1 + 3* 

W 7^T 


3 


Find an expression for the nth partial sum of each of the following 
series. Find the sum of the series if it converges. 


2 
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(iii) 3 + 6+12 + . . . + 3 (2 B_1 ) + . . . 

4 State the value of each of the following: 

(i) r 2 (-l) r+ V; (ii)i o (”)^ r+2 

5 Discuss the convergence or divergence of the following series. 

(i) x - hx a + ix 3 - $x* + . . . + (-1 ) n+1 x n /n + . . . 

oo ( n l)2 

(ii) 2 a n where a n = ^ 

where = 2 Vw1 + 3> } ' 

6 Discuss the convergence of the series whose general terms are 
(i)(-l)"*»/r*; (ii) (Sinn*)//! 2 ; (iii) (-l)"x 2 "/(2n)!. 

7 Determine whether the following series converge. 

(i) 2 (« + V")" 1 ; (ii) 2 (iii) 2 ^ 

n=l »=1 J n=I z 

8 Prove that, if the series 2 a n converges, then Lim a n = 0, and 

show by an example that the converse is not necessarily true. 
Determine whether the following series converge or diverge 

W nli ( “ 1)B ^+1 ; (U) „?! Vin(2n + 1)}‘ 

9 Find the radius of convergence of each of the following power 
series. 

... ,£ (n !) 3 x n * x n ® x B 

10 Prove that if 2 a n converges absolutely then 2 °n sin nx converges 
for all real values of x. 

11 Prove that if a power series a 0 4- a x x + a 2 x z + . . . converges 
when x = x 0 then it converges whenever |x| < [x 0 |. For what 
values of x do the following series converge? 



Exponential, Logarithmic, and 
Hyperbolic Functions 


4.1 THE EXPONENTIAL FUNCTION 

The general exponential functions may be defined in one of several 
ways. Their fundamental property is that the derivative of the function 
is proportional to the value of the function. That is they all satisfy 
equations, called differential equations, of the type 


dy 

i =c y 

where c is a constant. 

In particular the standard exponential function is defined to be that 
function which satisfies the differential equation 

<» 

and such that y — 1 when x = 0. (2) 

In example (ii) of section 3.6.1 the function ]£ “| was shown to be a 

n=0 n! 

function which satisfies (1) and when x = 0 its value is 1. Let this 
function be called E(x), i.e. 


E(x) = l + x + ji + yj + . • 
From equations (1) and (2) 

E\x) = E(x) 

E(0 ) = 1 




(3) 


(4) 

(5) 


4.1.1 Properties of E(x) 

E(x) satisfies the functional relation 


E(a)E(jS) = E(x + /?). (6) 


To prove this consider the function F(x) defined by 

F{x) = E(c - x)E(x) (7) 

where c is some constant. Differentiating equation (7) gives 
F'(x) = —E'{c - x)E(x) + E(c - x)E'(x) 

= -E(c - x)E(x) + E(c - x)E(x) using (4) 
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Hence F(x) = constant, since its derivative is zero 

or F(x) = k say, for all values of x 

or E(c — x)E(x) = k substituting from (7). 

Now, writing x = 0, 

£(c)£(0) = k, 

or £(c) = k since £(0) = 1 from (5). 

But k — E(c — x)E(x), 

hence E(c — x)E(x) = £(c), 

and writing c — x = <x, x — fi so that c = a + /3 gives 
£(«)£(/?) = £(« + /?) 

which is equation (6). 

Replacing a by x and f) by —x in equation (6), 

£(*)£(-*) = £(0) = 1 from (5) 

*-*-m (8) 

Repeated application of (6), setting a = p = . . . = x, gives 
{£(*)}" = E(x)E(x)E(x) . . . E(x) to n factors 
= E(x + x + x + . . . + x) (n terms) 

i.e. {£(*)}” = E(nx) n = 1,2, 3. (9; 

Hence if p and q are positive integers, setting x = plq, n = q in (9) 

{E(plq)Y = E{p) = {£(l» p (10) 

hence E(p/q) = {E( 1)} P '« (11) 

and E(x) = {£(1)}* whenever x = plq. (12) 

Since a rational number is any number which can be written in the form 
pjq where p and q are integers and q =£0, equation (12) is true whenever 
a: is a positive rational number. Writing —p for p in equation (11) 
shows that equation (12) is true when x is any rational number, positive 
or negative. 


4.1.2 The number e 

This is chosen to be £(1), the value of E(x) when x — 1, i.e. 

e = £(1) = 1 + 1 + 1 + i + . . . = 

and its value may be calculated to any degree of accuracy. For example, 
if thirteen terms of the series are added, e = 2-718 281 8 correct to 
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seven decimal places. From (12) 

{£(!)}* = E{x), 
so that e 1 = E(x), 

and the properties of E(x ) developed in section 4.1.1. are written in 
terms of e 1 as follows: 

~ (e z ) = e 1 from (I), 

e I =l+x + |y + |j + . • •= 2 jjy from (3), 
e« e « = e «+^ from (6), 
e~ x = ~ from (8). 



4.1.3 The graph of = e* 

When x = 0, y — e° = 1, dy/dx = e° = 1, so that the gradient of the 
curve at (0, 1) is 1. 

When * is positive, e 1 > 1 + x so that e 1 oo as x -> oo, and x 
increases steadily with x. 
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When x is negative, write x = — t, t> 0 then e 1 = e“‘ = l/e ( is 
always positive and as x -> — oo, t -> oo, e 1 -> 0. 

Hence as x increases, e 1 increases monotonically and is always 
positive. The graph is shown in Fig. 4.1. 

When the exponent is a lengthy expression it is sometimes convenient 
to write e* as exp (x). 

4.2 THE LOGARITHMIC FUNCTION 

Since the function e z is continuous and monotonic increasing for all 
values of x it is possible to define its inverse function (see section 1.2). 
This is the logarithmic function defined by the relation y = log„ x such 
that x = e y . 



Since x = e v takes all positive values, loge x is defined for all positive 
values of x only. The subscript e, will be omitted in the subsequent text 
and the symbol log will be used to denote the logarithm to base e. (The 
abbreviation In is also used.) Any base other than e, will be specially 
designated as such. For example logi 0 x means logarithm to base 10, 
the common base used in computation. 

The graph of y = log x is derived by ‘reflecting’ the graph ofy = e 1 
in the line y = x and is shown in Fig. 4.2. The important facts to notice 
are log 1=0, since e° = 1; y is not defined for negative values of x; 
and as x —> 0, y —► — oo. 

4.3 LIMITS CONNECTED WITH e* AND log jc 

As x —*■ oo, e 1 —*• oo more rapidly than any power of x, i.e. 

— ~>oo as x->oo (13) 


for any fixed positive value of n. 
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The function c x /x n assumes the indeterminate form 00 /oo as x -+ co, 
but since 

e ,_ 1 + , + f! + £ + ... 


then, with m any integer greater than n. 


e 1 > 


x m 

ml 


Therefore ^ which tends to infinity with x. Hence e x /x n -*■ co 

as x -*■ oo, or e r ‘grows’ more rapidly than x n . 

Similarly it can be proved that e _I tends to zero more rapidly than 
any power of x as x -»• oo, i.e. 

x n e _I ->0 as x —► oo. (14) 

Similarly it may be proved that as x -*■ 00 , log x -*■ 00 more slowly 
than any power of x, or 

!3L*-0 as (15) 

4.4 DIFFERENTIATION OF e* AND log x 
4.4.1 The derivative of e* 

From the definition of e*, 



d 

hence ^ c kx = kt kz where A: is a constant, 

using function of a function rule. 


4.4.2 The derivative of log x 
If 

then 

and 

or 


y - log x, 
x = e v 
dx 

dy~ G 
dy_±_ 1 

dx e w x 


Hence 


a 1o 6*-7 


x> 0 


As x increases, the derivative 1/x decreases, so that the gradient of the 
graph of y = log x decreases but is always positive (see Fig. 4.2). 
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* 1 ° 

|x| is always positive so that log |x| is always defined. When x is 
negative, |x| = — x, log |x| = log (— x) so that 

i log M = JZ >°g (-*) 


Also ^log|/(x)| = provided/(x) =£ 0, 

dz 

since, writing/(x) = z, -^ = /'(x), 

StogWXII-llogWjE 


4.4.3 The general exponential function 

The general exponential function is the function a z , a > 0. If y = 
then 

logy = xlog a 

and y = e Il0B ° 


Thus, instead of a 1 the simpler function e Il0 *° can be used in calcula¬ 
tions. In particular the simpler form is more useful in differentiation, 
for example, 

4-a x = e IIOB “ 
dx dx 

_ giloga J D g a a > 0 

— c f log a. 
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To differentiate the inverse function, y = log a x, write x = a y = t yloza 

dx . 

— s= e ylog “. log a 

= jc log a 
dy I 

and -f - = —- 

dx x log a 

1 


4.4.4 Logarithmic differentiation 

In the case of differentiation of a function which consists of a number 
of factors it is sometimes simpler to take the logarithm before differen¬ 
tiating. Thus if 

y =fi(x)f,(x)f 3 (x ). . .f r (x) 
then for any value of x which gives y a non-zero value, 

In y = ln/i(x) + ln/ 2 (x) + In f 3 (x) + . . . + In f r (x) 
and differentiations with respect to x leads to 

1 dy //(*) . Uix) | f 3 (x) | fr'(x) 

y dx / x (x) + f 2 (x) + f 3 (x) + ‘ ' + fr(x) ’ 

Example (i) Differentiate x* where a is any real number and x > 0. 
Solution Now, x“ = e” Iogr since x = e logl and so 


-(^) = 3r(e“ l0gI ) 


Example (ii) Differentiate jc*, x > 0. 
Solution Let y = x* = e IlogI 
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alternatively, let y = x z 

then logj = xlog.*: 


l -^ = lo gX +l 

^=j(logx+ 1) 


= A*(logA + 1). 

Example (iii) Differentiate y = log (log x). 
Solution 


dy _ 

dx 


log A 


A log A. 

The function log (log a) is defined for (log a) > 0 and hence for a > 1. 

„ s r~ 1 dy c e2l (* 2 + !)(* + l) 3 sin A 

Example^) Find ^ rf y- (J , + 4)(3;c _ 2) - 

Solution Write 

logy = 2a + log (a 2 + 1) + 3 log (a + 1) + log sin a 

-log (a* + 4)-log (3 a-2) 

^ + 2a ^ 3 + cos a 4a 3 3 


and _ _ 

an y dx * T a 2 + 1 T a + 1 ‘ sin a 

for any value of a for which y ¥=0. 


a 4 + 4 3a - 


4,5 THE HYPERBOLIC FUNCTIONS 

There are certain combinations of the exponential functions e 1 and 
e _I which are called ‘hyperbolic functions’. They are used in solving 
certain engineering problems and are introduced now so that they may 
be used in the systematic study of integration in Chapter 6. 


4.5.1 Definitions 

The combinations + e _I ) and |(e T — e _I ) occur with such fre¬ 
quency that it is convenient to give special names to them. They have 
properties analogous to those of the circular functions cos a and sin a, 
and are called hyperbolic cosine of a or cosh a, and hyperbolic sine of x, 
or sinh x, respectively, i.e. 

cosh a = i(e z + e _I )l 
sinh a = i(e* — 


( 16 ) 
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The remaining hyperbolic functions are defined in terms of cosh x 
and sinh x as follows: 


tanh x 
coth x 
sech x 
cosech x 


sinh x _ e 1 — e 1 
coshx — e 1 + e“ x 
cosh x _ e 1 4- e~* 
sinh x — e 1 — e _I 

1 _ 2 _ 

cosh x~e* + e“* 

1_ 2 

sinh x“e*- e 1 


4.5.2 Identities 

The basic identity of hyperbolic functions is 

cosh 2 x — sinh 2 x = 1 (17) 


which is proved as follows: 

cosh x + sinh x = £(e x + e' 1 ) + £(e x — e" 1 ) = e 1 , 
cosh x — sinh x = |(e* + e _I ) — £(e* — e' 1 ) = e -1 . 
Hence the left side of equation (17) is 

cosh 2 x — sinh 2 x = (cosh x + sinh x)(cosh x — sinh x) 
= e r e -1 


Dividing equation (17) by cosh 2 x gives 

1 — tanh 2 x = sech 2 x (17a) 

and dividing equation (17) by sinh 2 x gives 

coth 2 x — 1 = cosech 2 x. (17b) 


4.5.3 Addition formulae 

There are results analogous to those of the circular functions given by 
sinh (x ± y) = sinh x cosh y ± cosh x sinh y 
cosh (x ± y) = cosh x cosh y ± sinh x sinhy (note signs) 

These follow from the definitions of section 4.5.1 and are easily proved, 
starting with the right side. 

Also sinh 2x = 2 sinh x cosh x 

cosh 2x = cosh 2 x + sinh 2 x 
but 1 = cosh 2 x — sinh 2 x 

so that, adding and subtracting respectively, 

cosh 2 x = i(l + cosh 2x), sinh 2 x = J( c °sh 2x — 1). 
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4.5.4 Derivatives 

From the definitions in section 4.5.1, 


= Me 1 + e-*) 
— cosh x. 


Similarly 


- cosh x = sinh x. 


and 


- cosech 2 x, 


each of which may be checked by the reader. 


4.5.5 Graphs of the hyperbolic functions 

(i) The properties of y = sinh a = £(e* — e -1 ) are 

(a) x > 0, sinh x > 0 

(b) x = 0, sinh (0) = 0 

(c) ^ = cosh x = |(e T + e_I ) > 0 all x, 

i.e. sinh x is an increasing function for all values of x. Also sinh x is 
an odd function since 


sinh (—x) = K e_I “ e *) = —sinh x 
(ii) The properties of j = cosh x = £(e* + e _I ) are 

(a) cosh x > 0 all x 

(b) cosh (0) = 1 

(> 0 when x > 0 


' dx 


= sinh x 


(< 0 when x < 0. 


Hence when x is negative y decreases as x increases and when x is 
positive y increases as x increases. Also cosh x is an even function since 

cosh (—x) = |(e _I + e 1 ) = cosh x. 

The graphs of sinh x and cosh x are shown in Fig. 4.3. 
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(iii) From the expression y — tanh x, 
sinh x 
y ~ cosh x 
e* — e~ x 


it follows that 

(a) tanh (0) = 0, 


2 

21 + 1 


>0 all x. 
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(c) tanh x increases as x increases for all values of x, 

(d) tanh x = 1- , ->-1 as x -* oo, 

v ’ e 21 + 1 

and 

(e) tanh x < 1 for all values of x. 

Also, tanh x is an odd function since tanh (—x) = —tanh x. 

For sketch graphs of tanh x and coth x, which is derived from that 
of tanh x, see Fig. 4.4. 



4.6 THE INVERSE HYPERBOLIC FUNCTIONS 
4.6.1 Range of the inverse hyperbolic functions 

Since x = sinhy is a monotonically increasing function for all values 
of y it is possible to define the inverse function y = sinh" 1 x (some¬ 
times written y = argsinh x) for all values of x and y. (Note, argsinh x 
but arcsin x.) 

Similarly y = cosh -1 x (or argcosh x) is defined where x = cosh y 
is monotonic increasing, i.e. when y ^ 0, and x > 1. 
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The function y = tanh" 1 x (or argtanh x) is defined for all values 
of y since x = tanh y is monotonic increasing in this range, and 
1*1 < 1 - 


4.6.2 The logarithmic form of the inverse functions 

(i) If y = sinh" 1 x, then 

sinh y — x — |(e v — e -v ). 

Multiplying by 2e v and rearranging, 

e 2y - 2xe y -1=0. 

This is a quadratic equation in e v whose roots are given by 
e v = x ± V (* 2 + !)• 

Now e v is positive so that x ± -y/(* 2 + 1) must be positive also. Since 
(x 2 + 1) > x the positive sign of the square root is chosen, and 
e y = x + V (* 2 + 1) 
or y = log [x + VO 2 + 1)] 

or sinh -1 x = log [x + V(* 2 + 1)1 for aU values of x. (18) 

(ii) Lety = cosh -1 x, then x = coshy, x > l,y > 0 and so sinhy > 0. 
Using the definitions (16), together with cosh 2 y — sinh 2 y = 1, 

e y = cosh y + sinh y = x + V (* 2 — 0 - 
Hence y = cosh -1 x = log [x + \/(* 2 — 0 ], x ^ 1. (19) 

(iii) Lety = tanh -1 x, then x = tanhy, |x| < 1, and 

e w — e -v 


or 


e 2v 


e 2v - 1 
e 2y + 1 
1 +x 
1 - x' 


Hence y = tanh 

i ii 1 + * 

'*-*“> 81 — 

1*1 < 1- 

(20) 

Similarly coth - 

1 11 * + 1 

1*1 > 1- 

(21) 


4.6.3 Derivatives of the inverse hyperbolic functions 

These are simple algebraic functions similar to the derivatives of the 
inverse circular functions. 
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(i) Let y = sinh - 1 x, then x = sinh y 
dx. 

= coshy 
= V(sinh 2 y + 1) 

= V (* 2 + 1 ) 

The positive square root is chosen since cosh y > 0 for all values of y. 
Hence 

dy 1 

dx ~ V(x 2 + 1 ) 
d 1 

or ~r sinh 1 x = all x. 

dx V(* + 1) 

Similarly 

d , , 1 

~r cosh 1 X = - 77-5-7T, X > 1 

dx V(a — 1 ) 

d t , 1 ... 

^tanh' 1 x= |*| <1 

Jtcoth-AW> 1 . 


EXERCISE 4 

1 Differentiate with respect to x the following functions. 

(i) exp (a 2 ); (ii) exp (sec x); (iii) x 3 log (1 + x 3 ); 

(iv) (1 + x 2 ) log (1 + x 2 ); (v) x tan -1 x - log VO + x a ); 

(vi) log [x + VO 2 — 3)]; (vii) sinh (2X 2 ); (viii) x log x — x. 

2 Differentiate with respect to x the following functions: 

(i) x*; (ii) (sin x) 1 ; (iii) (V*) 1 ; 

(iv) x'' 1 ; (v) x Bini ; (vi) 10 lo * 3lnl . 

3 Evaluate f'(x)/f(x) when /(x) = (1 + x ) 1 and by successive differ¬ 
entiation calculate the values of/'(0), /*(0), and /"(0). 

4 Differentiate with respect to x the following functions: 

(i) (cosh x — cos x)/(sinh x — sin x); (ii) sech x; 

(iii) x tanhr 1 x + log VO — a 2 ); (iv) tan - 1 (sinh x); 

(v) log tanh x. 

5 Sketch the curve y = tanh 2 x. 

6 Find du/dx and du/dt if a, b, and p are constants and 

u = e -ai cosh (pt — bx). 
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7 Differentiate the following functions: 

(0 2 *; 

(ii) exp (sin 2 x); 

(iii) a Vx , a > 0 ; 

(iv) sinh 3x cosh 3 x; 

(v) (x 2 + 1) 3 /(2jc - l)(3x + 2) 2 ; 

(vi) log [(1 - x)/(l + x)]; 

(vii) x 2 exp ( 1 /x); 

(viii) a x x a , a > 0 . 

8 Given that z = x y , calculate 8 2 z/8x 8y and 8 2 z/8y 2 . 

9 Determine the nature of the stationary points of the function 
x 4 e~*, and show that for all values of x Js 0, x 4 e _I < (4/e) 4 . 

10 Determine the position and nature of the stationary point of the 
function cosh 2x — 4 cosh x. 


11 Prove that, if u = x log (1 + x/y) + y log (1 + yjx) then 

8u 8u 

X Tx + yTy- U - 

12 Differentiate (i) exp (x sin x); (ii) (logx) 1 ; (iii) (xlogx)/(l - 
(iv) (log x) -1 ; (v) log (sec x + tan x). 

13 Prove that, if y = sin -1 (e' 1 ) then 


*); 


d 2 y 


dy 

sin 2 y ^ = sin y cosy. 


14 Show that, if A and B are constants and 

y = A cos (log x) + B sin (log x), 

•hen ^S + Ar l + ->’ = 0 ' 

15 Show that sec x + log (cos x) is an increasing function of x in the 
range 0 <x < ^ • 

16 Prove that, if y = tan " 1 (sinh x), then 

17 Find dy/dx where log (x 2 + y 2 ) = tan" 1 (y/x), x 2 + y 2 0. 

18 Use the definitions of cosh x and sinh x to solve the simultaneous 
equations 

cosh x + cosh y = 4 
sinh x — sinhj = 2 . 

Give your answers correct to three significant figures. 
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19 The temperature y° of a body t seconds after it is placed in a certain 
medium is given by y = 27 + 63e _{ ' 6 . Find correct to two places 
of decimals, after how many seconds the rate of cooling of the body 
will have fallen to less than 1° per second. 

20 Solve the equation 

16 cosh x — 4 sinh x = 19, 
giving the answers correct to three places of decimals. 



5 

The Mean Value Theorem, 
Taylor Series, and Further 
Partial Differentiation 


5.1 THE MEAN VALUE THEOREM 

5.1.1 Introduction 

It is geometrically intuitive that if a function y = f(x) is continuous in 
the interval a < x < b and has a derivative at each value of jc in 
a < x < b and the values of y at A where x — a and B where x = b 
are equal, then there is at least one point between x = a and x = b 
where the derivative is zero. Moreover, if the values of y at A and B 
are not equal and the other conditions hold, then there will be at least 
one point, x = c, between x = a and x = b where the tangent is parallel 
to the chord AB. The sketch of functions y = f(x) in Fig. 5.1 demon¬ 
strates this property. 



Fig. 5.1 


The graph of y = f(x) may have a vertical tangent at either or both 
of the end-points of the interval, so that for example, the function 
V(a 2 — x 2 ), ~a < x < a which represents a semi-circle satisfies the 
requirements. 

These two results may be stated precisely in the form of theorems 
as follows: 

(1) Rolle's theorem Let y = f{x) be (i) continuous for a < x < b, 
(ii) differentiable for a < x < b, and (iii) letter) = f(b). Then there is 
at least one number c satisfying a < c < b and such that/'(c) = 0. 
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(2) The mean value theorem Let y = f{x) be (i) continuous for 
a < x < b, and (ii) differentiable for a < x < b, then there exists at 
least one number c such that 

fib) —f(a) = (b- a)f(c) and a < c < b, (1) 

or f(c) = and a < c < b. (2) 

The fraction [f(b) — f(a)]/(b — a), which is the ratio of the increment 
in the value of the function to the increment in the value of x, measures 
the average rate of increase of the function in the interval b — a, and 
is represented by the gradient of the chord AB of Fig. 5.1. Hence the 
mean value theorem implies that the average or mean rate of increase 
in any interval is equal to the actual rate of increase at some point 
within the interval. For example, the average velocity of a moving point 
in any interval of time is equal to the actual velocity at some instant 
within the interval. Or, if the average speed of a moving vehicle is 
50 km.p.h. over a given interval of time, then at some instant during 
that interval the speedometer registers 50 km.p.h. 

Other ways of expressing equation (1) are as follows: 

(i) writing c = a + 6(b — a) where 6 is some number between 0 and 
1, and writing a + h for b so that b — a = h, then (1) becomes 

f(a + h) - fid) = hf(a + 6h), 0 < 0 < 1. (3) 

(ii) setting dx for h in (3) gives 

fia + dx) -fid) = dxf\a + 0 dx) 
or df= dx f(a + 6 dx) 0 < 6 < 1 (4) 

where df is the increment in the value of / due to an increment dx 
in x when x — a. 

Certain deductions follow immediately from the mean value theorem, 
as follows: 

(i) if f(x) — 0 for all values of x such that a < x < b then f{x) is 
constant for a < x < b. 

(ii) if fix) > 0 for all x such that a < x < b, then f(b) >f(a) and 
f{x) is strictly increasing for a < x < b. 

(iii) if m < f(x) < M for all x such that a < x < b, then 

m(b - a) <f(b) -/(a) < M(b - a), 

(iv) if fiix) — f 2 {x) for a < x < b then 

^[/i(*)-/ 2 (*)J = 0, 

and f x {x) — f 2 (x) = constant for a < x < b. 
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5.1.2 The generalized form of the mean value theorem 

Suppose f(x) and g(x) are two functions which are continuous for 
a < a < b and differentiable for a < x < b and that g(a) =£ gib), then 
the function <E>(x) defined by 

satisfies the conditions of Rolle’s theorem. Hence there is at least one 
number c between a and b such that = 0, i.e. 

and a <c<b. 

Moreover if/' and g' are not simultaneously zero this may be written 

m —/(g) /'(c) 

gib) - g{a) g\cj K ’ 

and this is the generalized or Cauchy form of the mean value theorem. 


5.1.3 Application to small errors 

Let y — fix ) satisfy the conditions of the mean value theorem, then 
from equation (4) 

by = bx f'(a + Obx), 0 < 0 < 1. (6) 

This is an exact equation, and gives the value of by when bx is known 
for some value x = a. If bx is small then (6) may be used to compute 
an approximate change in the value of y corresponding to a small 
change bx in the value of x when x = a. For if bx is small enough 
f\a) is an approximation for f\a + B bx) so that 

by^bxf'(a) (7) 

(See equation (18) of section 2.4.) 

Example (i) If y = \/x, calculate the approximate error in the calcu¬ 
lation of / when a is wrongly given as 1 -02 instead of its correct value, 1. 

Solution Here f{x) — y/x, f'(x) = lx' *, /'(1) = i bx = 0-02 and 
by =£= 0-01. 

Example (ii) The period T of a simple pendulum is given by 
T = IWil/g) 

where / is the length of the pendulum. If an error of f per cent is made 
in measuring I, and g is accurate, find the corresponding percentage 
error in the value of T. 

Solution This question involves percentages and it is well to remember 
that this implies percentage of something. 

Suppose the error in measuring l is positive, i.e. / is measured too 
large, then the error bl is | per cent of /, or bl/l = | per cent, and it is 
required to find bTjT. 
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T=2« V(l/g) 

log T = log 2 tt + \ log / — \ log g 
and differentiating implicitly with respect to / leads to 


ldT_ _1 

T dl~ 21 


Using 6T^ (dT/dl) d l then gives 


<5 TldT.. 
T T dl d 


= \ • i per cent, 

and the approximate percentage error in T is J. 

If the measurement of / is too small, the error is negative and this 
simply means a change of sign of dl in the working. 


5.1.4 l’Hopital’s rule 

Suppose /(x) and g(x) are two functions which are continuous in 
a < x < b, and that f\x) and g'(x) exist for each x in a < x < b, 
that g'(x) 0 and that /(a) = g(a) = 0. Then l’Hopital’s rule states 

that 


Urnm-Umm 

x-*a g (x) 


a g(x) ~ 


( 8 ) 


provided the second limit exists. For, since/(a) = g(a) = 0, 
fix) /(x) -f(a) f\c) 


g(x) g(x) - g(a) g'(c) 


and a < c < x, 


applying equation (5) of section 5.1.2. As x 

n M- 


► a+, c —*■ a+ and 


Lim J -y-{ = Lim J —~ = Lim 

x—*a+ g(x) C ^ a+ g(c) ^ a + 


fix) 

'(X)’ 


provided this limit exists. 

This establishes the rule when x approaches a from above. The case 
when x approaches a from below is proved similarly and the two cases 
together give the result (8). 

The rule (8) is used to determine Lim {f(x)/g(x)} where substitution 

of x = a in the quotient leads to the expression 0/0 which is indetermin¬ 
ate. 

Example (i) Lim X £= gj 
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log x r 0] 

Example (u) Lim j |^= 


= Lim-y^ = 1- 

The rule was established on the assumption that g'(*) ^ 0 in the 
open interval a < x < b and hence is valid when g'(a) — 0. Thus if 
f(a) is also zero then equation (8) gives another indeterminate form 
0/0 and applying l’HopitaPs rule again, 

Un m 

r-a g (x) i-*a g (x) 
provided the second limit exists. 

More generally, applying rule (8) m times 




f im Kx) 

ag <m> (x) 


provided this limit exists. Conditions on /, g and their m — 1 deriva¬ 
tives may be deduced from considerations of the mean value theorem 
and conditions for PHopital’s rule. 

In practice the method consists in differentiating numerator and 
denominator separately so long as the form 0/0 appears at x = a. As 
soon as one or other of these derivatives is different from zero at 
x = a, the limit may be found. 


, ,..., T . V0 + x) - (1 + $x) 

Example (ui) Lim-y- 


= Lim 

x ->0 


Ki +*)-»-! 
2x 


t-a 


= Lim 

x-*0 


-10 + *)-« 
2 


= -i 

It can be proved that if f(x) -*■ oo and g(x) -> oo as x ->■ a, then 
l’Hopital’s rule applies to the indeterminate form oo/oo. Proof is 
beyond the scope of this book. 

The indeterminate forms oo. 0 and oo — oo may be evaluated in 
some cases by reducing them to a form in which l’Hopital’s rule applies. 

The forms 0°, 1 ”, oo°, as limits, may also be evaluated if the expression 




can be reduced to a form in which l’Hopital’s rule applies, for example 
by considering the logarithm of the expression. 


Example (iv) Find the value of 


Lim (4--) [= 

i-». \sinx x) L 


oo—oo 
-oo + oo 


x-0+1 
x -*■ 0—J 
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Solution The expression may be re-written as 
- sinx 


Lim - 




xsmx 

and applying l’Hopital’s rule twice the limit is 

Li m 1 — cos x 
sin x + x cos x 

— Lim sin x 

x—o 2 cos x — x sin x ~ 


[-a 


Solution Writing y = x x , then 

logj = xlogx 
= log x/(l/x), 

Tl - lo S* 


and 


Hence 
so that 


[-S] 


V* 

-l/x 2 


Lim y = e° = 1. 

x-*0 + 

The assumption made in this last step is that 

Lim (log y) = log /Limj’L 
*-*o v I—o / 

which can be proved since log y is continuous. 

Example (vi) Evaluate Lim L |j= 

Solution Applying I’Hopital’s rule, the required limit 


= Lim — 

I-»a> 6 * 

= Lim 

and there is no ‘limit’, or the ‘limit is infinity’. This shows that e 1 -> 00 
more rapidly than does x 3 . See section 4.3 and example 8 at the end of 
this chapter. 
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5.2 TAYLOR SERIES 

The mean value theorem of section 5.1.1 states that if f(x) is continuous 
for a < x < b and has a derivative fix ) for a < x < b, then 
fib) — fa) = (b — a)f’(c) where a < c < b. 

Suppose now that fix) is continuous for a < x < b and that fix) 
exists for a < x < b. Consider the function defined by 

® a (*) - ®a(«) =/(*) -/W - ib - *)/'(*) 

- ^zr${m -m -(*-«)/(«)}. 

This function is zero when x = a and when x = b and satisfies the 
conditions of Rolle’s theorem. Its derivative is 

-fix) +fix) -ib- x)fix) + {fib) -fa) -ib- a)fia)}, 

i- e - {fib) -fid) -ib- a)fid) - \{b - affix)}. 

Hence there exists a value c between a and b such that this derivative 
is zero, i.e. 

fib) = fid) + ib- a)fid) + i(b - dff(c) and a < c < b. 
Extending this argument, suppose / <n_1) (x) is continuous for 
a < x < b and f n \x) exists for a < x < b and consider the function 

0„(x) — ^ where 

<5>nW =m -fix) -ib- x)f{x) - ... - - (6 (/ ~_T)! 1 / <B ~ 1)(x) - 

This function satisfies the conditions of Rolle’s theorem and hence its 
derivative is zero for at least one value of a = c between x = a and 
x — b. This leads to the result known as Taylor's theorem that 

m = fid) + ib- a)fia) + ^—^f(a) + . . . 

+ _ Q ]); / (n ~ 1) (g) + {b - n -r~ f n \°). and a < c < b. (9) 

This may be written more compactly, and slightly differently, by writing 
x for b and using the summation symbol, i.e. 

fix) = fa) + '£ ~\f m \d)ix - a) m + R n (x) (10) 

where R n (x) is the remainder after n terms and 

Kix) = ^ (x - aff'Xc), a < c < *. 
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The special case a = 0 is known as Maclauriris theorem, and writing 
c — 8x where 6 is some number between 0 and 1, 

fix) = /(0) + * m / (m) (0) + x'f'XQx). (11) 

The formulae (9), (10) and (11) are valid only if / (n, (x) is defined for 
all numbers x between a and b. T his assumption is not true for all 
functions, so that not every function can be expanded in a Taylor 
series. It is true for most of the more common functions. 

The form of the remainder R n (x) in (10) as (x — a) n / (n) (c)/«! and in 
(11) as x n f (n) (8x)/n ! is known as the Lagrange form of the remainder. 
Even if f {n \x) exists for all values of x for a < x < b and the Taylor 
series can be found for /(x), it is necessary to show that the difference 
between f(x) and the sum of the first n terms of the series, or R n (x) 
is ‘as small as we please’ as n -> oo. 

5.3 POWER SERIES EXPANSIONS OB FUNCTIONS 
The problem of representing a function by a power series is an important 
one, and can be divided into two parts. First it is necessary to develop 
a general formula to give a power series representation of a function 
of x, i.e. to give a series of the type 

a 0 + OiX + a 2 x 2 + . . . + a n x n + .... 

Secondly it is necessary to decide whether a given function can be 
expressed in this form and if so, for what range of values of x the series 
represents the function. The second part of the problem is by far the 
more difficult and is in general beyond the scope of this book. 

5.3.1 The Maclaurin series 

Let y — /(x) be the given function and suppose the power series 
expansion of the function can be found, i.e. 

f(x) = A 0 + Aix + A 2 x 2 + . . . + A n x n + ■■■ ■ (12) 

Let <f>(x) = a 0 + ayx + a 2 x 2 + . . . + a r x r + . . . for |x| < R (13) 
(see section 3.6.1), 

then 4>'{x) = a x + 2a 2 x + 3a a x 2 + . . . 4- ra r x r_1 + . . . 

4>"(x) = 2a 2 + 3. 2a 3 x + . . . + r(r — l)a r x r ~ 2 + . . . 

and <f> <n \x) = n \ a n + terms containing powers of x, for all x for 
|x| < R and for all integer values of n. Setting x = 0 in these equations 
leads to 


*» = ^ <n) (0). 


(14) 
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Hence if f(x) has an expansion (12) it must be of the form 

m +*/'«>) + o + • • •+^/ (n, (o) +... (is) 

which is Maclaurin’s expansion (11) of section 5.2. This development 
shows no more than that a condition which must be satisfied by f(x) 
is that its derivatives of all orders exist for |x| < R. By considering 
the sequence of remainders 7?i(x), R 2 (x),. . R n (x), .... it is possible 
to estimate the remainder after n terms of the expansion, and if the 
sequence tends to zero it is possible to show that the series (15) con¬ 
verges and for which values of x. 

5.3.2 Some standard series 

(i) For/(.x) = log (1 + *),/(0) = 0, then 

/<">(*) = (-D"- 1 ^ - 1)1(1 + *)- n > 

/ <n, (0) = (-lr-K* - 1)!, 

and substituting in equation (15), gives 
log(1 + x) = x-j + j- + . . • + (/" — 1)!(— l) r_ 1 + • • • 

= x ~t + j- + -. . + (-ir i 7 + . • •• 


The remainder after n terms of (15) is R n where 

R n(x) = (—!)" —; (i + Qx) n 
IPMI- W_ 


n(l + 0x) n n 
Lim |i?i(x:)| = 0. 


Hence log (1 + x) = x - y + y - + . . . + (-1)"' 
is valid 0 < x < 1. 



If x is negative, it may be proved that the series is valid for values 
of x between 0 and —1 only, so that equation (16) is valid for 
-1 < x < 1. 
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(ii) For f{x) = sin x, then 

f (n) (x) = sin (x + imr) 
and /<">(0) = sin \rm. 

When n is even, n = 2k, and/ <n> (0) = sin Ictt = 0 
When n is odd, n = 2k + 1, and/ (n) (0) = sin (2 k + Y)^n = (-l) fc . 
The Maclaurin series for sin jc is 

,_?!+£+ (^lry- 

3! 1- 5 , f . . . + (2* _ !)i + • • • 

j^2» + l 

with R 2n+1 (x) = ^ 2n + 1); sin {Ox + \{2n + 1)tt) 
and 0 < 0 < 1. This is the remainder after (2n -f 1 terms of (11). 
Since |sin y\ < 1, then 

for any fixed finite value of x, no matter how large, and hence 
Lim |/? an+1 (x)| = 0. 

Thu, ^-*-£ + £-+... + fc^^ + ... 


(iii) Analagous methods may be used to show that 
e*=l+x + 2j + ... + — + — 


cos * = 1 — — + — — 


(1 + x)“ = 1 + a.x + * 2 + • • • + x n + . . ., \x\ < 1 

and a is any real number. 

These standard series and their radii of convergence should be 
remembered and used. For example. 


is valid foi 



MEAN VALUE THEOREM, TAYLOR SERIES 89 


(iv) To find the power series expansion of sin x about jc = \tt. 

For f{x) = sin x then f in \x) — sin (x + \mi) and /(| tt ) = 1, 

I f (n> (iA = cos inn. If n = 2k then f (2k \in) = cos kn = (— l) fc , and if 

| n = 2k + 1, then f {2k + 1 Xb T ) = cos (kn + W) = 0. Using equation 

(10) of section 5.2 with a = jn, the Taylor series expansion is 


(x-in) 2 _ , (-l)"- 1 ^ - &r) 2n - 2 , 

2! + 4! * • • + (2n — 2)! + 


and 

which tends to zero as 


l*a«(*)| 


(X ~ br) 2n 


(2 n)l 


tends to infinity for all values of x. 


5.4 TAYLOR SERIES FOR A FUNCTION OF TWO 
VARIABLES 

Let co = f(x, y) be a function of x and y such that co and its partial 
derivatives up to a required order are continuous at (x 0 ,yo). Consider 
the value of f(x 0 + h, y 0 + k ). Keeping the y coordinate y 0 + k 
constant and using Taylor’s theorem of section 5.2 with a = x 0 , 
b = x 0 + h so that b — a = h, 

Ax o + h,y 0 + k ) =/(x 0 ,jo + k) + hf x (x 0 ,y 0 + k) 

+ 2\fA x o, Jo + k) + . . . (17) 

Again, applying Taylor’s theorem to the terms on the right of (17), 
Ax 0 ,y 0 + k) =/(*o,jo) + kfy(x 0 ,y 0 ) + YJ vy {x 0 ,y 0 ) + . . . 

k 2 

A(x 0 ,y 0 + k) =f z (x 0 ,y 0 ) + kf vx (x 0 ,y 0 ) + Ji/™x(*o, Jo) + • • • 
Ax(x 0 ,y 0 + k ) =f xx (x 0 ,y 0 ) + kf xyx (x 0 ,y 0 ) + . . . 

Substituting these values in equation (17) as far as the second order 
terms gives 

/(t 0 + h, y 0 + k) =/(t 0 , jo) + kf y (x 0 ,y 0 ) + || f y y(x 0 ,y 0 ) + . . . 

+ hf x (pc 0 , jo) + hkfy X (x 0 ,y 0 ) + . . . 

h 2 

+ 2 i/«(*o,Jo) + • • ■ • (18) 

This equation expresses the value of the function f(x, y) at x = x 0 + h, 
y =y 0 + k in terms of the values of the function and its partial 
derivatives up to the second order at (x 0 , Jo), together with powers of 
h — x — x o and k = j — j 0 . An expansion using higher derivatives is 
written more simply if the following ‘shorthand’ form is used. The 
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expression hf x + kf y is written (h 8/8x + k 8/8y)f and may be thought 
of as the operator (h 8/8x + k 8j8y) operating on /. Then 

(4 ♦*£)’/ 

is a short form for (h-^ + k^ operating on h^ + k^- 
The result is 

since the mixed partial derivatives are equal because they are continu¬ 
ous, and h, k are arbitrary constants. Similarly 

[ h T x + k l$f= h *f*« + + 3hk2 U + k*f yyv . 

Hence the Taylor series expansion about the point (x 0 , _y 0 ) of the 
function f(x, y) is 

f(x 0 + h,y 0 + k) =f(x 0 ,y 0 ) ^ (h~ + k^j f(x 0 ,y 0 ) + R n 

(19) 


remembering that the partial derivatives on the right are all evaluated 
at the point (x 0 , y 0 ), and where 

Rn = h{ h ^ + k A X ° + 6lh ’ y ° + ° 2k) 

and 0 < < 1, 0 < 0 2 < 1. 

If R n -*■ 0 as n -> oo, the series represents the function f(x, y) near 
(^0) To)- 


5.5 APPLICATION TO MAXIMUM AND MINIMUM 

Problems of the maximum and minimum of functions of two or more 
variables occur in many systems of physics, chemistry and engineering. 
In Chapter 2 conditions for the maximum and minimum of functions 
of one variable were developed. In this section the maximum and 
minimum of a function of two variables will be considered. Extension 
to functions of three or more variables may then be deduced. 


5.5.1 Definitions 

Consider the function co =f[x,y) defined in some region R of the 
(xy) plane. The function o> = f(x, y) has its greatest value or absolute 
maximum at a point (x 0 , y 0 ) within or on the boundary of R if the value 
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of the function at this point is greater than the value of the function for 
all other points in the region, i.e. if/(x„, j„) > /(x, j) for all (a, j) in R. 

The function f(x, j) is said to have a local or relative maximum, or 
simply a maximum at the point P (x 0 , j 0 ) in R, if its value at P is greater 
than its value at any point in the immediate neighbourhood of P. In 
other words, the value of to = f(x, y ) at all points ( x , y) near to (x 0 , y 0 ) 
and in any direction from (x 0 , j 0 ), is less than the value co 0 — f(x 0 , y 0 ). 
Thus there is a local maximum at (x 0 , j 0 ) if 

f(x 0 + h,y 0 + k) </(x 0 ,j 0 ) 

provided h and k are small enough. 

An absolute minimum or least value and a local minimum value of / 
are similarly defined. 

If the function /(x, y) has either a local maximum or a local minimum 
value at (x 0 , j 0 ) then it is said to have an extreme value at this point. 

5.5.2 Analytic conditions for an extreme value 

Suppose to = f(x, y) has an extreme value at the point (x 0 , j 0 ) in R. 
Keeping y constant and equal to y 0 ,f(x,y 0 ) is a function of x only which 
has an extreme value at x = x 0 and hence its derivative is zero at 
x = x 0 , i.e./ I (x 0 , jo) = 0. Similarly, keeping x constant and equal to x 0 , 
/(xo, y) is a function of y only which has an extreme value at y = y 0 
and hence / u (x 0 , j 0 ).= 0. 

Any point (x, y) at which / = 0 and /„ = 0 is called a stationary 
point of /, so that wherever /has an extreme value it has a stationary 
point. The converse is not true, since it is possible to find functions 
which have stationary points which do not give extreme values. For 
example, if /(x, j) = x 2 j 2 , then /, f„ and /„ are all zero at the point 
(0, 0) but this point is not an extreme one because f x and f y are also 
zero at all points on both the x- and j-axes and/is positive everywhere 
else. 

The condition f x = 0, f y = 0 determines those values of (x, y) at 
which there is an extreme value of/but this condition is not sufficient. 
Let A be the point (x 0 , j 0 ) and B be (x 0 + h, y 0 + k), where h and k 
are^arbitrary, and let f x (x 0 , Jo) = 0 = /„(x 0 , j 0 ). Then from the Taylor 
expansion near the point A, 

fix o + h, jo + k) -/(x 0 , jo) = \{h 2 f xz + 2 hkf xy + k 2 f yy } + R 3 (h,k) 

(20) 

where the partial derivatives, f xx ,f yy , and f xy are evaluated at the point 
(x 0 , jo) and R 3 is the remainder. This term R 3 tends to zero with h and 
k and is a lower order of magnitude than any of the other terms of (20). 
It may be assumed for h and k sufficiently small, that (20) may be 
replaced by 

/(x 0 + h, jo + k ) -/(x 0 , jo) = ph 2 + 2qhk + rk 2 
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where 2 p =f XI (x 0 ,y 0 y, 2q = f ly (x 0 ,y 0 ), and 2 r = f yy (x 0 y 0 ). 

The quadratic form may be written 

ph 2 + 2 qhk + rk 2 = p{h 2 + 2qhk/p + rk 2 \p) 

This is said to be positive definite for all values of h and k provided that 
pr — q 2 > 0 and p > 0 and is negative definite for all values of h and 
k provided that pr — q 2 > 0 and p < 0, and is zero only when 
h = k = 0. 

Hence if the quadratic form on the right of equation (20) is positive 
definite for all values of h and k, f(x 0 + h, y 0 + k) >f(x 0 ,y 0 ) and 
f(x 0 , y 0 ) is a minimum value of /; if the quadratic form is negative 
definite for all values of h and k, f(x 0 + h,y 0 + k) <f(x 0 ,y 0 ) and 
fix o, jo) is a maximum value of/. 

Moreover if h and k are small enough and the second order deriva¬ 
tives of f are continuous the condition 

/* = 0 =f y , f xx f yy -f%> 0 

at a point (x 0 , J D ) means that/has an extreme value at that point. This 
is a maximum when f zx < 0 and a minimum when /„ > 0. 

If at (x 0 ,y 0 ),f x = 0 =/„ and f xx f yy — f\, < 0, the binary quadratic 
on the right of equation (20) is said to be indefinite and its sign may be 
positive or negative according to the signs of h and k. This gives what 
is called a saddle point at (x 0 , y 0 ). 

If at (x 0 ,y 0 ),f x = 0 =f v and f xx f yy — f xv = 0, the quadratic form is 
said to be semi-definite and the nature of the stationary point is un¬ 
decided. Further terms of the Taylor series must then be considered, 
but in this book only conditions of second order terms are considered. 

Example (i) Investigate the extreme values of the function 
fix, y) - x 2 + xy + y 2 + x - 4y + 5. 

Solution The function f{x,y) has partial derivatives 
f x = 2x + y+\, f y = x + 2y-4 
and these are both zero at (—2, 3). 

The second order partial derivatives are all constant, f xx — 2, 
f yy = 2,f xy = 1, and the expression which determines the nature of the 
stationary point at (—2, 3) is 

fzxfyy —fxt = 4—1=3>0. 

The function therefore has a minimum at (—2, 3). Moreover this is its 
least value, since 

/(_2 + h,3 + k) >/(-2, 3) for all h, k. 
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Example (ii) Investigate the extreme values of the function 
fix,y) = (x 2 + y 2 ) 2 - 4(x 2 + 2y 2 ) + 13. 

Solution The function fix, y) has partial derivatives 

fx = 4x(x 2 + J 2 — 2), /„ = 4j(x 2 + f - 4), 

/„ = 4(3x 2 +y 2 - 2), f yy = 4(x 2 + 3 y 2 - 4), 

fxv = % x y- 

There are five stationary points whose coordinates are found by 
solving the equations / = 0 = f y , and they are (0, 0), (0, ±2) and 
(±V2,0). 

At (0, 0), f„f yy -f 2 „ = (—8)(—16) - 0 > 0 
/„ < 0, hence /is a maximum at (0, 0). 

At (0, ±2), f xx f yy - f\ y = (8)(32) - 0 > 0 
and f xx > 0, hence /has minima at (0, ±2). 

At (± y/2, 0), f xx f yy - f\ y = (16)(—8) - 0 < 0 
and / has saddle points at (± y/2, 0). 

5.6 FURTHER PARTIAL DIFFERENTIATION 

5.6.1 The increment of a function of two variables 

Consider now an approximate value of the increment of a function of 

two variables. Let co = f(x,y) and let its partial derivatives f x and/, 

be continuous functions of x and y throughout a region R in the (xy) 

plane containing (x 0 , j 0 ). Then the increment 

da> =f(x 0 +dx,y 0 +dy)-f(x 0 ,y 0 ) (21) 

is given by 

dm =f(x o + dx,y 0 + <5>’) -/(x 0 + dx, y 0 ) +/(x 0 + <$x,_y 0 ) -f(x 0 ,y 0 ) 

= dcOi + dm 2 
where 

= f(x 0 + dx,y 0 + dy) —/(x 0 + dx, y a ) (x held constant) 
and doj 2 = f(x 0 + dx,y 0 ) —f(x 0 ,y 0 ) (y held constant). 

Thus, holding y constant at y = y 0 and changing x from x 0 to x 0 + dx 
and using the mean value theorem, 

doj 2 f(x 0 + dx, jo) - fix o, Jo) 
dx ~ dx 

is the value of (3<n 2 /3x) teliVO) = f x (x ly j 0 ) for some x 1 between x 0 and 
x 0 + ^x, i.e. 

dm 2 = /(Xi, j 0 )(5x and x 0 < x x < x 0 + dx. 


( 22 ) 
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Since f x is a continuous function of x and y, its value at (xj, y 0 ) is 
nearly the same as its value at (x 0) y 0 ) when dx is small. Hence the 
difference denoted by e x and given by 

/*(*i>/o) -fx(x 0 ,y 0 ) = 

approaches zero as dx approaches zero. Substituting for y 0 ) in 
equation (22) gives 

do> 2 = {f z {x 0 ,y 0 ) + s 1 } dx, (23) 

and e ± -*■ 0 when dx -> 0. 

Similarly dcox =f y {x 0 + dx, y 0 + 6dy} dy = {f y (x 0 , y 0 ) + e 2 } 6y (24) 

where 0 < 6 < 1 and e 2 -+■ 0 when dy -*■ 0. Substituting from equations 
(23) and (24) in den = da> 1 + dm 2 gives 

day = f x (x 0 , y 0 )dx + f y (x 0 , y 0 )dy + dx + e 2 dy (25) 

where £i and e 2 both approach zero when dx, dy approach zero. 

An analagous result holds for a function of any finite number of 
independent variables. For the case of a function of three variables 
c° —f(x,y, z), 

day =f x dx +f y dy + f z dz + £i dx + e 2 dy + £ 3 dz 
where e u e 2 , e 3 -» 0 when dx, dy, dz -*■ 0. 


5.6.2 Total differentiation 

Let co = f(x, y) be differentiable at all points along some curve C in a 
region R in the (xy) plane and let the equation of C in terms of the 
parameter t be x = x(t), y = y{t) where x and y are differentiable 
functions of t. Then co is a differentiable function of t and 

dm dm dx da> dy 

!i = !)xdi + dy7t (26) 

This follows from equation (25) where the parameter t has value t 0 at 
the point (x 0 , y 0 ) and when dt —> 0, dx and dy both -» 0 and e L and e 2 
both -> 0. 

The formula (26) may be extended to give the total derivative of a 
function of n variables f(x x , x 2 , x 3 ,. . ., x„) where x u x 2 , x 3 ,. . ., x„ 
are differentiable functions of the parameter t. Then 

d7_df_dx 1 ^df_dx 2ju dLd_x 3 ^ tf_dxn n , 

dt ~ 8 Xl dt ^ 8 x 2 dt 8 x 3 dt + ‘ + 8x n dt ’ K ) 
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A particular case arises when f(x,y ) is a constant. The equation (26) 
then gives 

n - d l d -^ . 

dx dt " 1_ dy dt 

which leads to 

dy _ dyjdt _ dfjdx 
dx dx/dt SJJdy 

The latter result is the derivative of y with respect to x when x and y 
are related in the implicit form f{x,y) = constant (see also section 
2 . 2 . 11 ). 


Example Find dy/dx given that jc 5 + 4 x 2 y 3 — 3 y 5 = 2. Use this result 
to find the equation of the tangent to the curve * 5 + 4x 2 y 3 — 3 y 5 = 2 
at the point (1,1). 

Solution Here f(x, y) = x B + 4x 2 y 2 — 3 y 6 , and hence 
dy _ 5x 4 + 8Ay 3 
dx I 2 x 2 y 2 — 15 y*' 

The equation of the tangent to the curve at the point (1, 1) is given by 


or 

which is 


^" i)== (SL ( * -i) 

O' - i) = (+ 7 ) (* -1) 

3 y - I3x + 10 = 0. 


5.6.3 The chain rule 

No essential complication is introduced by considering the derivative 
of the function 00 = fix, y) where * and y are functions of two para¬ 
meters r and s, i.e. x — x(r, s ) and y = y{r, 5 ). Then when s is held 
constant and writing 


the result is the two formulae 


d(o 

~8~r 


etc. 


8 w 8(0 dx 8 (o 8 y \ 
dr ~ dx dr + dy dr I 
8(0 _ 8(0 dx (ho dy I 
8 s ~ dx 8 s ^ dy 8 s ) 
More generally suppose to = J(y u y 2 , y 3 ,. . 


(28) 


• ,y n ) and y x ,y 2 ,y z . 
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. . ., y n are each functions of m variables x x , x 2 , x 3 ,. 
y k = yk(x 1 , X 2 , x 3 , . . x m ), k= 1, 2, 3,. . n, then 
day _ dco 8y t 
8 x r , 8 y t 8x r 


( 29 ) 


This rule is the chain rule of differentiation and may be compared with 
the function of a function rule of section 2.2.9 of Chapter 2. 


Example Given that m = f(x,y) is transformed to F(r, 0) by substitu¬ 
ting x = r cos 0 and y = r sin 6 , find dfldx and dfjdy in terms of r, 6, 
dF/dr and 8F/80. 


Solution First set of variables: x, y 
Second set of variables: r, 6 
(a) 8 f/8x and dfjdy are to be found 


* dx 8y . dx 

) Tr- cme ’fr = sme -1 5 

I d JL- d l d JL 


SF _dfdx Sfdy 
86 ~ dx 86 + 8y 86 
8 F df 




0) 

CO 


«9=- ,slnS to + rcos9 i 

The question requires dfldx and dfldy which can be found in terms of 
r, 6, dF/dr, and 8 F/86 by solving (i) and (ii) as simultaneous equations. 
Multiplying (i) by r cos 6 and (ii) by sin 6 and subtracting gives 


0/_ 


„ 8 F 


8 F 


(30) 


and multiplying (i) by r sin 6 and (ii) by cos 6 and adding gives 

d l 


8 y~ 


8 r 


Notice that when x — r cos 0, y = r sin 6 
a different function F(r, 6) is obtained. 

Equations (30) and (31) may be written in operator form, 

5 a 8 1 • „ 3 

~~ 6 -r -sin 


(31) 

substituted in f(x,y) 


8 x 


8 r 


86 


(32) 


1 


- = sin 6 -y- + - cos 6 73 - 


(33) 
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Since r 2 = x 2 + y 2 and tan 0 = y/x, it is possible to evaluate df/dx 
directly as 

¥ _ dFdr_ 8F80 
dx dr 8x + 86 8x 

and this is left as a useful exercise for the reader. 


5.6.4 Transformation of second order derivatives 


Many applications involving second order partial derivatives are 
simplified by a change of variable. Suppose <w = f(u, v ) becomes 
F(x, y ) on substitution of u — u(x, y), v — v(x, y) where x, y are 
independent variables and u, v are differentiable functions of x, y. 
Suppose also that the second order partial derivatives f m and f vu are 
continuous and so equal. By equation (28) 


8F_8fdu dfdv 
dx du dx dv dx 

or the operator 

8_8v 

dx ~ dudx^ dv dx 
Using operator (35) on equation (34) 
d_ td_F\ _ 8_(c£du dfdv\ 
dx \0x / dx \8u dx dv dx) 


(34) 

(35) 


- (« 


du df d 2 u d 


fa* + fa ( d f! dv ) fa + 


dx du dx 2 _r dx 


dfdh 
dv dx 2 


df d 2 u df d 2 v du (d du 8 x dv) 

= du dx 2 + dv dx 2 + dx \du dx + dv {d ^ d ) dx) 


df d 2 u df d 2 v ^ d 2 f ^ du'' 
= 8udx 2 + dvdx 2 ' 1 ' 


dv (d du d dv) 

+ fak^rx + Tv^^} 

d*f (8u\ 2 ..S^Sudv dfftdv\ 2 
du 2 \8x) dudv dx dx ' 8v 2 \8x) 


Similarly, 
8 2 F 


d 2l = d l d 2a , d l d 2i + ( d j)Y , 2 , ey /m 2 

8 y 2 du dy 2 dv 8y 2 du 2 \dy) dudv 8y dy dv 2 \8y/ ' ' 


and 

d 2 F 


df d 2 u 
dxdy du dxdy 


df d 2 v d 2 f du du d 2 f /du dv du dv\ 

dv dxdy du 2 dy dx dudv \dy dx dxdy) 


d 2 f dv dv 
dv 2 dy dx 


(38) 




98 


MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


The formulation of equations (37) and (38) are left as exercises for the 
reader. 

Example Find 8 2 fjdx 2 if to = f(x, y) = F(r, 6) where x = r cos 8 and 
y = r sin 6. 

Solution Using operator (32) to find d 2 f/8x 2 gives 
8 ldf\ / „ S 1 . a \ / „ 8F 1 . . 8F\ 

to (to) " ( cose Fr-T sm9 58) ( cos0 07-7 sin0 ee) 

,8 1 „ 8F 1 . n 8F\ 

= cos0-^cos0-- 7 sm0^j 

1 . „ 8 I 8F 1 . n 8F\ 
~~r Sm6 86 [ C0S 9 Tr~~r Sin6 8dj 

= cos 6 [ cos 6 -8? + ^ 1sm 9 re ~ 7 sm 9 m) 

— j sin 6 ^—sii 


,8 2 F 2 

e -8?-~r C 


868r 

1 . 

8 2 F 1 . „8 2 F 


I) 


8r86 


+ 7 2Sind W 2 


Similarly f vv and f xv may be evaluated. 


EXERCISE 5 

1 By using the mean value theorem or otherwise, prove that for every 
positive number /? 

Pe~ B < 1 — e - * < (i. 

2 If in the observed values of the variables x, y, z there are percentage 
errors of +1, —2, +1*5 respectively, find approximately the per¬ 
centage error in the value of co calculated from the formula 

co = 27x 5 / /4 /z 4 - 

3 The area A of a triangle of sides of lengths a, b, c is given by the 
formula A = ^{s(s “ a )( s ~ b)( s ~~ c )} where s = l(a + b + c). 
A plane triangular lamina of sides a — 5, b = 6, c = 7 units is to 
be cut from a plane piece of metal. If the side b is cut exactly, but 
small percentage errors a, y are made in the sides a, c respectively, 
show that the percentage error in the area is approximately 
(125a + 49y)/144. 

4 The volume of a cone is obtained by measuring its height h and its 

semi-vertical angle a and then using the formula V = %nh 3 tan 2 a. 
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If the measurements of h and a are subject to small percentage 
errors p h and p a respectively, find an approximate expression for the 
resulting percentage error in V if a is approximately \tt. 


5 Evaluate the following limits: 


(ii) Lii 


sin x — tan x 


(iv) Ii 


(v) 1 


X 2 + X 


„ 2x 2 + r 

6 Evaluate the following limits: 

„ log X - (x - 1) + K* - l) 2 . 


Clii) 

x-0 log* ’ 

, ^ t tan x — x 
(vi) Lim y _ ■ 


(i) Lim - 


(x - l) 3 


(ii) Lim - 


7 Evaluate the following limits: 

. sinh x — x . sec 3 x 

j_*o 1 — cos x sin x 

(v) Lim x{V(* 2 + 1) - x}. 


8 Use FHopital’s rule to prove 

Lim = 0 and Lim = 0 
(see section 4.3). 

9 The function/(x) is defined by/(x) = ^ when x ^ 0,/(0) = 1. 

Use the definition of the derivative of a function as a limit to 
evaluate /'(0). 

10 Derive a power series expansion for log (1 + x) stating its range of 
validity. Show that 

tanlr 1 x = | log [(1 + x)/(l - x)], |x| < 1. 

Hence find the first three non-zero terms in the expansion of 
(i) tanh" 1 x, (ii) tanh -1 (sin x). 

11 Write down the series expansion of log (1 + x) and of cos x in 
ascending powers of x and show that, when x is small, 

log cos x = — \x 2 — x*/l2 . . .. 

12 Write down the Taylor expansion about x = 0 (Maclaurin expan¬ 
sion) for the following functions, stating the general term and the 
range of values of x for which your series is valid: 

(i) sinx; (ii) sin (6 + x); (iii) e CI ; 

(iv) (1 — x) -1 ; (v) (1 — x)“ when « is not a positive integer. 
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13 Write down the Maclaurin series for the function log (1 + \x 2 ) 
stating the general term. For what values of x is the series valid? 

14 The Taylor series for a function of two variables about the point 
(a, b) may be expressed in the form 

f(a + h,b + k ) =f(a, b) + Jr ^ j/i ^ + k f(a,b). 

Write a + h = x and b + k = y and obtain the Taylor series about 
the point (0, 0) for the function e 1 cos y as far as the terms of the 
second degree in x and y. 

15 Test the following functions for maxima, minima and saddle points: 

(i) x 2 + y a — 2x + 4y + 8; (ii) x 2 - y 2 - 2x + 4y + 8; 

(iii) 5 + 2x + 2y — 2x 2 - 2xy - y 2 . 

16 Show that the function /(x, j) = x 2 + xy + y 2 + x — 4y + 9 has 
an absolute minimum at the point (—2, 3). 

17 Show that the function 2x 4 — 3 x 2 y 2 + y i + 8x 2 + 3 y 2 has sta¬ 
tionary values at the points (0, 0), (5, ±6), (-5, ±6). Investigate 
the second order properties of the function at each of these points. 

18 Determine the maxima and minima of the function 

f(x,y) = (ax 2 + by 2 ) exp [-(x 2 + /)] 
where a, b are constants and 0 < a < b. 

19 Determine the position and nature of maxima, minima, and saddle 
points on the surface 

z = x 2 + y 2 — £x 4 . 

20 Show that the surface 

z = x 2 — xy + y 2 + 2x + 2y — 4 

has a minimum at the point (—2, -2, —8) and no other maximum 
or minimum. 

21 Determine the minimum value of the function 

3(x + yf + 2(x - y) 2 - 8x. 

22 Show that the function / = (x 2 + y 2 ) 2 + 2 a\x 2 - y 2 ) where a is a 
constant, has one saddle point and two minima. 

23 Show that the function f(x, y) = x 3 + 2x 2 y — y 2 has a saddle 
point at (—f, -eg). 

24 Show that the function 

f(x, y) = (x + y) 3 + (x - y) 2 - 12(x + y) 
has a minimum at the point (1, 1, —16). 
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25 Prove that, if x = r cos 0, y = r sin 6 and F(r, 6) =f(x,y), then 

Frr + ~F r + J^F ee = fzz + fyy 

Hence, or otherwise, show that f(x, y) = tan" 1 (y/x) + log (x 2 + y 2 ) 
satisfies the equation f xx + f yy = 0. 

26 Change the variables from x, y to u, v where u — x + y and 
v = x/(x + _y) and show that x log (x + y) — x + tan" 1 {xj(x + v)} 
satisfies the equation 

xf z + yf y = x\o%(x + y). 

27 By changing the independent variables from x, y, z to u, v, w where 
u = x, v = x + y/x, w = x + yz show that the equation 

xyfx +y(y - x 2 )f y = (x + yz- x 2 z)f z 
transforms to xyF u = 0 where f(x,y, z) = F(u, v, w). 

28 If z is a function of u and v where u = x 2 — y z , v = 2 xy, show 
that (i) xz x + yz y = 2(uz u + vz v ), (ii) yz x — xz y = 2 (yz u — uz v ). 
Show also that if 

z uu + z m = 0, 
then z„ + z yy = 0. 

29 Find dy/dx at the point (—1,1) given that x 3 — 2 y 3 + 3 xy 2 + 6 
=0. 

30 Show that the tangent to the curve 2x + x log 2x + y e* = I at 
the point (J, 0) has the equation 2y + 6x = 3. 
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Integration 


6.1 THE INDEFINITE INTEGRAL 

The problem is to find a function y = F(x) such that its derivative in 
some given range, say a < x < b is a given function f(x). In symbols 
this may be written 

g iven % =/(*). (1) 

find y = F{x). (2) 

For example, if fix) = 2x then one solution is y = x 2 . On the other 
hand y = x 2 + 2, y = x 2 — 7m are also valid solutions, and so also is 
y = x 2 + C where C is any constant. 

An equation such as (1) which specifies the derivative as a function 
of x (or as a function of x and/) is called a differential equation. Differ¬ 
ential equations of more general types will be considered in Chapter 11. 

A function y = F(x) is called a solution of the differential equation 
(1) if, over the range a < x < b, F(x) is differentiable and 

F\x) =f(x). (3) 

In these circumstances F(x) is called an integral or a primitive of fix) 
with respect to x. 

If Fix) is an integral of fix) then F(x) + C is also an integral where 
C is any constant. For if equation (3) is satisfied, [Fix) + C]' = Ffx) 
= fix). Moreover all integrals of fix) may be expressed in the form 
Fix) + C. For let Ffx) and F 2 (x) be two functions which satisfy 
equation (1), and using deduction (iv) of section 5.1.1, then Ffx) 
and Fffx) differ by a constant. The precise value of C cannot be 
determined without further data and hence C is called an arbitrary 
constant. This is indicated by writing 

J/(x) dx = F{x) + C (4) 

where F{x) is any primitive of f(x) and C is an arbitrary constant. The 
symbol J is called an integral sign and J . . . dx is interpreted as the 
inverse of the symbol d/dx. 

6.2 STANDARD FORMS 

Integration as defined above requires the ability to guess the answer. 
The first requirement for skill in integration is a thorough knowledge 
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of differentiation. The following list contains formulae for derivatives 
together with the corresponding indefinite integral. Arbitrary constants 
of integration are omitted, a and n are constants and only the simplest 
form of integral is given. 


d 

dx *“ ~ H 


I* x n dx = n £ -1 

(5) 

d 

1 

f dx 




1 ~ = logW 

(6) 

i- e 1 = e r 

dx 


|* e 1 dx = e 1 

(7) 

d 


r a 1 


Tx aX = an °g fl 

i axdx= w a > 0 ’ a ^ 1 

(8) 

d . 

dx sm x ~ 

COS X 

j* cos x dx = sin x 

(9) 

d 

~~7~ cos X = 
dx 

■■ — sin x 

j* sin x dx = — cos x 

(10) 

d 

j- tan x = 
dx 

sec 2 x 

j* sec 2 xdx = tan x 

(11) 

II 

* 

8 

*«l-3 

■■ — cosec 2 x 

j* cosec 2 xdx = —cot x 

(12) 

d 

1 

C dx 


~r sin -1 x 
dx 

” VO - * 2 ) 

j V( 1 _**) = ”> '*■ w<‘ 

(13) 

d 

1 

f dx 


-j- tan -1 x 
dx 

- l+x* 

T~, — 2 = tatl " x 
} 1 + x 2 

(14) 

d 

~r cosh x 
dx 

= sinh x 

j sinh xdx = cosh x 

(15) 

d ■ u 
~r sinh x ■■ 
dx 

= cosh x 

j* cosh xdx = sinh x 

(16) 

d 

-y- tanh x 
dx 

= sech 2 x 

j' sech 2 xdx = tanh x 

(17) 

d , 

~r coth x 
dx 

= — cosech 2 x 

j* cosech 2 xdx = —coth x 

(18) 

d . 

1 

r dx 


~r sinh -1 , 
dx 

* “ VO + x 2 ) 

—,- 71 "-.-~ = sinh 1 X 

) VO + X 2 ) 




= log \x + VO + * 2 )| (19) 

d , , 

1 

f dx 


-j- COSh” 1 
dx 

* - V(X 2 - 1) 

1 n- cosh 1 x 

J V(x 2 - 1) 



= log |x + v/(* 2 ~ 1)|, x > 1 (20) 
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f dx 

Jrrp- 


= * lo s|nz^|’ * 2 < 1 ( 21 ) 


d , , -1 

■j- cotlr 1 x = —5 - 7 

ax x z — 1 


r 

—x - 7 = — coth“ 

J A 2 - 1 

= 2 lOg 1^ 


- 1 

+ 1 ’ 


6.2.1 Rules 

The following rules help to reduce the amount of guesswork in integra¬ 
tion in many cases and lead to further standard forms. Arbitrary 
constants of integration are omitted. 

(al It follows from section 2.2.5 that 

J (a/(x) + bg{x)} dx = a J /(x) dx + bj g(x) dx (23) 

where a and b are given constants. This rule may be extended to the 
integral of the sum of any finite number of functions. 

(b) From equation (9) of section 2.2.9, 

J flax + b) dx = ^ F(ax + b), a y 0 (24) 


where a and b are known constants and F'(x) = /(x). 

Example (i) Jsin ( 3 x + 2) dx = — J cos ( 3 x + 2 ). 

Example (ii) J*( 5 x + 3 ) _ 1 dx = |\/( 5 x + 3 ). 

These results follow from equation (24) directly and compare with 
examples (i) and (ii) of section 2.2.9. 

The following are some standard forms which follow from equation 
(24), a, b and n are constants and a # 0. 


J (ax + b) n dx = ~ 


n =£ —l 


- log |ax + b\, ax + b =£ 0 


I ax + b ~ 
[ e aI dx = 


(25) 

(26) 
(27) 


cos (ax + b) dx — - sin (ax + b) 


(28) 
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dx (x 

VV 2 - * 2 ) V> 

i)- Ist<' 

(29) 

dx 1 , / x\ 

-o-t —5 = - tan -1 I -1 
a 2 + x 2 a W 


(30) 

dx . , , / 

— ,7 o ,-^ = Sinh 1 

V(a 2 + * 2 ) \ 

= log |x + V(a 2 + V>| 

(31) 

// 2 o\ — cosh 1 1 

VC* - a 2 ) ' 

(j) = log \x + V (* 2 - a 2 ) |, 



|i| >1. 

\a\ 

(32) 


J ^{/(*)} n+1 = (« + 1 ){/(*)}"/'(*), 

using the rule for differentiating the function of a function, then, when 

h^-1, 

/ {/(*))"/'(*) dx = ^{/(x)}—, (33) 

and when n — — 1 , 


J^^ = log |^)| (34) 

provided /(x) ^ 0 . 

Example (i) J* <fx = |J(sin 2 x )“ 5 (2 cos 2 x) </x 

= (sin 2 x)“ 6 (sin 2 x )' dx 
= K-l)(sin 2 x)- 4 . 

Example (ii) J tan <ix = J dx 

= -\^§dx 
= —log |cos x| 

i.e. J tan xdx = log |sec x\. ( 35 ) 

Example (iii) J cosec x dx = J 

= f_ 

J 2 sin |x cos £x 
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Multiplying numerator and denominator of the integrand by \ sec 2 \x 
makes this integrable in the form of equation (34), so that 

f , f i sec 2 lx 

J J tan jx 

— log I tan |a|. (36) 

Example (iv) J sec x dx may be evaluated in one of two ways. 

First, using equation (36) and the identity cos x = sin (a + r), 

Jsec x dx = Jcosec (a + %n) dx = log |tan (|x + Jtt) | (37) 

Second, writing 

_ sec x (tan x + sec a) (tan x + sec x)' 

~ (tan a: + sec x) ~ (tan x + sec x) 
and using equation (34), 

J sec x dx = log | tan a + sec x\ (38) 

Example (v) J cos 3 xdx—j cos 2 x cos x dx = J( 1 — sin 2 a) cos a dx 

=Jcos x dx — J*(sin x) 2 (sin a)' dx 
— sin x — J sin 3 x. 


This method applies whenever an odd power of sin x or cos * is to be 
integrated. 

Example (vi) J cos’" a: sin" x dx, m,n integers. 

If one of m or n is an odd integer the method of example (v) may be 
used to evaluate the integral. 

If both m and n are even integers or if one is an even integer and the 
other is zero, it is necessary to use the identities cos 2 A = £(1 + cos 2A) 
and sin 2 A = ^(1 — cos 2A). 

For example, if m = 0, n = 2, 

J"sin 2 x dx = jl(l — cos 2a) dx = lx — l sin 2a, 

and if m = 2, n = 4, 

cos 2 a sin 4 a = (2 — cos 2a — 2 cos 4a + cos 6a)/32 
and the integral is ~rsx — A sin 2a — sin 4a + 1 I 2 sin 6a. 
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Example (vii) To integrate 

sin mx cos nx, sin mx sin nx, and cos mx cos nx, 

where m and n are integers it is necessary to use the trigonometrical 
formulae 


sin A cos B = | [sin (A — B) + sin (A + B)], 
sin A sin B = \ [cos (A — B) — cos (A + £)], 
cos A cos B = | [cos (A — B) + cos (A -f B)]. 


Thus for example, if m =fcn, 

J cos mx cos nx dx — J |[cos (m — n)x + cos (m + n)x] dx 
, [sin (m — n)x sin (m + ri)x~\ 

“*L m-n + m + n J’ 

and if m = n, 

J cos 2 nxdx = J J(1 + cos 2 nx) dx = £[x + (sin 2 nx)/(2n)\. 


6.3 THE DEFINITE INTEGRAL 

Let f(x) be defined in the interval a < x < b and let F(x) be continuous 
in a < x < b and such that F’(x) = fix) in a < x < b. Then from the 
mean value theorem (1) of section 5.1.1, 

F(b) — Fla) = lb — a)F'lx) = lb — a)flx) and a < x < b. 

This is written 


^flx)dx = Flb)-Fla) (39) 

and is defined to be the definite integral of fix) in the interval from 
x = a to x = b. 

6.3.1 Geometrical interpretation of the definite integral 

Suppose the function fix) is positive and continuous in a < x < b. 
Using an intuitive idea of area, consider the area enclosed by the curve 
y = fix), the x-axis and the ordinates at a and x, where x is some 
point between a and b. This area is a function of x and may be written 
A(x). The problem is to show that A'lx) = fix), a ^ x < b. 

Using Fig. 6.1, Jet Q be the point between a and b on the x-axis whose 
x-coordinate is x + h, then the ordinate QR is fix + h). The area 
between the curve, the x-axis, and the ordinates at a and x + h is then 
A(x + h) and the area, shaded in Fig. 6.1 is A(x + h) — A lx). 

Let M be the greatest and m the least value of fix) in the range 
from x to x + h. Then, using the lettering of Fig. 6.1, 
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area of rectangle PQRS < A(x + h) — A(x) < area of rectangle PQTU 
i.e. mh < A(x + h) — A(x) < Mh 



Now, as h -* 0, x being held fixed, both m and M ->■ /(*), since / is 
continuous, and hence 


Lim 

A—0 


j4(x + h) — A{x) 
h 


=/W, 


^'(*) =/W- 


(40) 


Let F(a) be any indefinite integral of f(x) so that F'(x) = f(x). Since 
F(x) and A(x) have the same derivative they differ by a constant, i.e. 
yl(.v) — F(x) = k, where k is some constant. When x — a, A(a ) = 0, 
hence k = —F(a) and 


or 


A(x) = F(x) - F(a), 

A(x) = Jf 1 f(x) dx = F(x) - F(a), (41) 


using the notation of (39). 

It is usual to denote F(x) — F(a ) as [F(jc)]£ where the square bracket 
with its limits of integration a and x, is an instruction to substitute and 
subtract. 

The total area enclosed by the curve y = f(x ), the x-axis and the 
ordinates at x = a and x — b is hence F(b) — F(a) or f fix') dx and is 
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usually called ‘the area under the curve’ from x = a to x = b. More¬ 
over it follows that 

£ f(x) dx = £ f{x) dx + £ fix) dx where a < c < b. (42) 


6.3.2 The ‘sign’ of an area 

When/(x) is negative and b > a > 0, since 

£ -/(*) dx = Fid) - Fib) = - £ fix) dx, 

then the integral is negative. This is explained geometrically by attaching 
a negative sign to any area below the x-axis. 



More generally if y = fix) takes positive and negative values in the 
interval a < x < b as does the function represented by the curve in 
Fig. 6.2 where the function is negative for c < x < d and positive 
elsewhere in the interval, from (42), 

| fix) dx = J fix) dx + ^ fix) dx + £ fix) dx 

= the algebraic sum of the separate integrals. 

Hence the area enclosed between the curve and the x-axis and the 
ordinates at x = a, x = b is the sum of the moduli of the separate 
integrals, i.e. 

arCa = || ^ dX \ + |I ^ dX \ + |I ^ dX \ 

Example J cos x dx — j^sin xj = sin -n- — sin 0 = 0 , 

but the area enclosed between the curve y = cos x, the x-axis, and the 
ordinates at x = 0 and x = tt is 


J cos x dx — J ( 


*r 


cos x dx = 2. 
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6.4 INTEGRATION OF THE RECIPROCAL OF A 
QUADRATIC 


6.4.1 The integrand of the form l/(x 2 + px + q) 

Consider the integral 

J + d p X + q > P4 8 iven constants. (43) 

Since x 2 + px + q = (x + ip) 2 + q — ip 2 , the form of the integral 
depends on the value of q — \p 2 and there are three cases to consider: 

(a) If p 2 = 4q, x 2 +px + q = (x + ip) 2 and 

r dx r _ dx _ i 

J X 2 +px + q ~ J (x + ip) 2 ~ (x + ip)’ 

(b) if p 2 <4 q, (x + ip) 2 + q — ip 2 ~ (x — a ) 2 + /3 2 where a and 

are real. Then from (30) of section 6.2.1 

f dx r dx 1 
J x 2 +px + q~ J (x - «) 2 +/? 2 “/? tan 1 P )’ ( ) 

(c) if p 2 > 4q, x 2 + px + q has real factors and the integral may be 

evaluated using partial fractions. Suppose 

x 2 +px + q = {x - <x)(* - j 8 ) 


where a and /3 are real, then 

, V(« ~ P) _ l/(« ~ P) 

x — a x — P 


(x - a)(A - ft 
dx 


r dx f dx 

j x 2 +px + q~ j (x-*)(x- 


P) ~ « “ 0 


(45) 


6.4.2 The integrand of the form (ax + b)/(x 2 + px + q) 
Consider the integral 

f (ax + b) dx 

J x 2 + ~ px + q ’ P’ 1 S lven real constants - 


(a) It is possible to choose constants A, p such that 
ax + b = l(2x + p) + p. 


i.e. A (derivative of the denominator) + p. Hence A = ia, p = b — \ap, 
then 


' (ax + b) dx . C (2x + p) dx f dx 
x 2 + px + q~ J x 2 + px + q + P J x 2 + px + q 


(46) 


The first integral is A log jx 2 + px + q\ and the second has already 
been dealt with in section 6.4.1. 
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(b) When the denominator x 2 + px + q can be resolved into real, 
distinct factors it is easier to write (ax + b)/(x 2 + px + q) in its partial 
fractions immediately. Suppose x 2 + px + q = (x — a)(x — p), a, /? 
real, then write 


ax + b = A B 

(x - o)(x - P) ~ x - a + x - p 


(47) 


where A, B are constants to be found. The constants may be found by 
one of three methods, (i) the ‘cover-up’ method; (ii) the ‘identity’ 
method; (iii) the ‘comparison’ method, which are explained as follows: 


(i) The ‘ cover-up ’ method 

Multiply both sides of (47) by x — a to give 

ax + b _ B(x — a) 

x~P = + x-P ’ 

and set x = a so that A = (act + b)/( a — p). This is equivalent to 
‘covering-up’ the factor x — a on the left of (47) and writing x = a 
in the remaining fraction to give the value of A. Similarly B is found 
to be (ap + b)j(p — a) when the factor x — /? is covered up on the 
left of (47) and ft substituted for x in the remaining fraction. 

This method is very quick and always works when the factors in the 
denominator are simple distinct linear factors. 


(ii) The ‘identity method 

Consider the identity, supposed true for all values of x, 

ax + b = A(x — P) + B(x - a) (48) 

obtained from (47) by multiplying by (x — a)(x — P). In (48) set 
x = a. and A = (aa. + b)j( a — p), set x = p and B = (ap + b)/(p — a). 


(iii) The ‘ comparison’ method 
Write (48) in the form 

ax + b = (A + B)x + (-Ap - Bol), 

and equate coefficients of x and constant terms to give two simultaneous 
equations, a = A + B and b = —Ap — Bo. from which A and B are 
found. 


6.4.3 The integrand of the form /(x)/(x 2 + px + q) 

When the function f(x ) is a polynomial of degree two or more it is 
necessary to divide by the denominator until the integral is of the form 
(ax + b)j(x 2 + px + q). The integral is then evaluated using one of 
the methods outlined above. 
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_ , f dx C dx 

Hxample (,) j 2 _ - _ p - J (1 _ , X 2 + ,) 

-J{nb + rN»* 

= * >°e 

f dx r dx — 1 

Exm,ple („) j 4;c ,_ 4j[+1 = J = 2(2^)’ 

, ..... f dx r dx 

Examp,e (,„) j l _ x+x ,=J (Je _ 1)s+} 

2 , / 2 x - 1 \ 

= V3 tan 1 V3 /' 

Example (iv) 

J 1 - X + .V 2 J 1 ~ X + X 2 

= ^3 tan - 1 i log (1 - x + x 2 ). 

= £ log |x - 2| + f log |x + 1 J 

Example (vi) 

" J{‘ - TT X + 3 - ( ~- Tf) ~ 5 + 2 } dx 

= x - log(x 2 + 2x + 3) - ^2 tan- 1 

6.5 INTEGRATION OF RATIONAL FUNCTIONS 
Consider the integration of the rational function 

™ (49) 

R(x) 

where P(x) and R(x) are polynomials in x, R(x) is not necessarily a 
quadratic and the factors of R(x) are known. The procedure is as 
follows: 
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(a) if the degree of P(x) is not less than the degree of R(x), divide the 
denominator into the numerator until the remainder is of the form 


fix) 

m 


(50) 


where the degree of f(x) is less than the degree of R(x); 

(b) separate the remainder (50) into its partial fractions. Let R(x) = 
(x — a)(jt — fi) m (x 2 + px + q) n , where m and n are positive integers, 
so that (50) becomes 

_yw_ 

(x - a)(x - pr(x 2 +px + qY 

and the degree of f(x) is less than m + 2n + 1. Then the following rules 
apply: 

(i) to each distinct linear factor such as x — a, there corresponds a 
fraction A/(x — a) where A is a constant to be found by one of the 
methods of section 6.4.2 (b); 

(ii) to each repeated factor such as (x — p) m there corresponds m 
fractions of the form 


* - p + (x - w + (x - pr + • • • + ix - /?)-»’ 
and B„ i = 1, 2, 3,. . ., m are constants to be found. The ‘cover-up’ 
method may be used to find the value of B m ; 

(iii) to each quadratic factor such as (x 2 + px + q) n there correspond 
fractions of the form 

C lX + D t C 2 x + I> 3 C n x + D n 

x 2 +px + q + ix 2 +px + q) 2 + ' ’ ' + (x 2 + px + q) n ' 
where C { , D { , i = 1, 2, 3,. . n are constants to be found. 

Example Express 

,, v _ x i — x 2 — \ 

(x - 2)(x 2 + 2x + 3) 

as a sum of partial fractions and hence evaluate J f(x) dx. 

Solution Since the degree of the numerator is greater than the degree 
of the denominator, it is necessary to divide so that 

4 . 6x ~ 1 

X + {x- 2)(x 2 + 2x + 3)' 

Since x 2 + 2x + 3 will not factorize into real factors, partial fractions 
are given by 
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Then A = 1 by the ‘cover-up’ method, and the ‘comparison’ method 
leads to values B = — 1, C = 2 and 


fix) = x + 


1 

x-2 


x-2 

x 2 + 2x + 3' 


Hence 




«2i + 2) 


dx 


" x 2 + 2x + 3 T (x + l) 2 + 2j 
= \x 2 + log \x-2\-i log (x 2 + 2x + 3) 

3V2 t Jx+1 

+ ~ tan ("7 T 


6.6 INTEGRATION OF FUNCTIONS WITH 
IRRATIONAL DENOMINATOR 

Consider the integration of functions of the type 
ax + b 

->/ 2 , -;— v a, b, p, q real constants. 

V(x 2 +px + q)' 


6.6.1 Numerator unity 

The method of evaluating 


dx 


J VC* 2 + px + q) 

is very similar to that of section 6.4.1, that is, by completing the square 
of the quadratic x 2 + px + q and using one of the standard forms 
indicated by (29), (31) or (32). 

Example Evaluate 


'=ft 


VC* 2 - 4x + 13) 

Solution The quadratic x 2 — 4x + 13 = 9 + (x — 2) 2 and hence 

/«r_*_ 

Js V{9 + (X- 2) 2 } 

, [**-. (ir_ 2 )][ 

= sinh" 1 2 — sinh -1 1 

= log {2 + V(2 2 + 1)} - log (1 + V(l 2 + 1)} 


= log 


2 + V5 
1 + V2' 
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6.6.2 Numerator in linear form 

The method of evaluating 
f (ax+ b) 


J V (* 2 +px + q) 

is very similar to that of section 6.4.2 (a). 
Example Evaluate 

J_j (x-Ddx 


dx, a, b, p, q given, real constants, 


- 4, so that 


J V(5 -4x- x 2 ) 

Solution The derivative of 5 — 4x — x a is — 4 — 2x and the numerator 
of the integrand is written x — 2 = —1(—4 — 2x) - 
(-4 - 2x) 

-* a )“ 

-V(5 - 4* - x 2 ) - 4 sin' 1 {J(x + 2)}. 


If (~4 - 2x) C dx 

~2j ^(5-4x-x°) ax 4 JV( 9-(x + 2f} 


6.7 INTEGRATION BY PARTS 

The method of integration called integration by parts depends on the 
formula for the differentiation of a product, namely that 

d(uv) _ dv du 
dx ~ U dx 0 dx’ 


where u and v are differentiable functions of x. 

Suppose / and g are differentiable functions of x and that 

G(x) —jg(x) dx, then the formula for integration by parts is 

jfgdx^fG-jof'dx (51) 


The derivative of the right side of this equation is fg + f'G — Gf 
which is fg, the derivative of the left side and this proves (51). 

For this method to be of use it is necessary that the integral on 
the right of (51) should be easier to evaluate than the given integral. 
Note that of the two terms / and g the latter is integrated in each term 
on the right of (51). 


Example (i) Evaluate / = J 


xe 31 dx. 


Solution Both functions x and e 3 * have simple integrals. If x is chosen 
as the function to be integrated, the power of x increases and the 
resulting integral is no simpler. If x is chosen as the function to be 



116 


MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


differentiated the resulting integral is simpler than the original. Hence, 
choosing/ = a, g = e 31 , then G = \e Zx and 

/ = JH xe 3x dx 
= [*ie**£ - [ l |e 31 dx 

= [*Ae 3 * - e 3l /9^ 

= (2e 3 + l)/9. 

This method of integration involves the product of two functions 
and it is necessary to integrate one of the functions and differentiate 
the other. At first the choice of function to be integrated may appear 
difficult. Once one of the functions is chosen to be integrated then this 
function must continue to be integrated should it be necessary to 
perform further integration by parts. 

Example (ii) Evaluate I = je 2x sin 3x dx. 

Solution Choose g = e 2 * then / = sin 3x and 

7 = Je 21 sin 3x — J \e 2x (3 cos 3x) dx. 

Using formula (51) again for Je 21 cos 3 a dx and again choosing 
g = e 2x , leads to 

je 2 * cos 3a dx = Je 21 cos 3a — J £e 2l (—3 sin 3a) dx. 

Thus 7 = |e 21 sin 3a — fe 21 cos 3a — 97/4 

and transposing the term containing 7 on the right to the left gives 
137/4 = \t 2x sin 3a — Je 21 cos 3a, 


and 7 = e 2l (2 sin 3a — 3 cos 3a)/13 + an arbitrary constant. 

It is recommended that the reader evaluates this integral by choosing 
g = sin 3a and / = e 21 . 

Formula (51) is used to integrate functions whose derivative is known 
and whose integral is not of standard form, such as log a, tan -1 a etc. 


Example (iii) Evaluate I log a dx. 
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Solution The integrand is considered as the product of two functions, 
/ = log x and g = 1, Then 

Jlog xdx = x log x — Jx(l/x) dx 

= x log x — x + an arbitrary constant. 

Example (iv) Evaluate J tan" 1 x dx. 

Solution Let f = tan -1 x, g = 1, then 

J tan -1 x dx = jjr tan -1 xj — J {x/(l + x 2 )} dx 

= [x tan’ 1 x - i log (1 + x 2 )J 
= tan -1 1 — | log 2 
— \ log 2. 

6.8 REDUCTION FORMULAE 

The technique of integrating by parts repeatedly is often necessary to 
evaluate an integral such as 

/„ =Je*x»«&. (52) 

Integrating by parts gives 

I n = e r x n - J e z nx n ~ 1 dx (53) 

and the integral on the right is the same integral as the original of 
equation (52) with n — 1 in place of n. Hence it is written /„_! and 
integral (53) becomes 

/„ = e*x n - «/„_!. (54) 

This is the reduction formula for the given integral which may be used 
for any value of « > 0. For example, if n = 3, 7 3 = e T x 3 — 3/ 2 , / 2 is 
evaluated by writing n = 2 in formula (54) as / 2 = e*x 2 — 2 I u and 
similarly I x = e z x — 7 0 , 7 0 = e 1 so that finally 

7 3 = xV - 3xV + 6xe z - 6e x + C 
where C is an arbitrary constant. 

Simple results are often achieved if the integrals to be reduced are 
definite integrals. 

Example Obtain a reduction formula for 



n > 0 . 
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Solution The function sin n x may be written as the product of sin n_ 1 * 
and sin* so that integrating sinx and differentiating sin"' 1 * and 
integrating by parts, 

h = j^— cos x sin" -1 xj — J (—cos x)(w — 1) sin" -2 a cos x dx 

or ljr 

I n — {n — 1) j sin" -2 x cos 2 x dx. (55) 

Now } 

I n - 2 = sin" -2 x dx. 

Hence it is necessary to transform the integral in equation (55) using 
the identity cos 2 x + sin 2 x = 1, so that 

In = (« — 1) J sin"" 2 x(l — sin 2 a:) dx, 

or /» = («- 1 )/„_ 2 - (n - 1 )/„. 

Solving for /„ gives 

nl n — {n— l)/ n _ 2 
n - 1 


In ■■ 


-In-2 


(56) 


which is the required reduction formula. 

When n is a positive integer the formula may be applied repeatedly 
until the remaining integral is either I u when n is odd, or /„, when n 
is even, where 

fin 

h = sin x dx = 1 and J 0 = dx = \n. 

Writing n — 2 for n in formula (56) gives 


and writing n — 4 for n in formula (56) gives 


and so on, so that finally. 


n — 1 n — 3 n — 5 
n n -2/T^4‘ 


. I 0 or A. 


When n is odd, /„ 


(n - l)(n - 3)(n - 5). ■ ,4.2 
n(n — 2)(n — 4)... 5.3 
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„ (» - 1)(« - 3)(n - 5) . . . 3 . 1 *■ 

when n is even, /„ = „(n - 2)(n - 4)7. .4.2 ' 2 

6.9 FURTHER SUBSTITUTIONS 

Rule (c) of section 6.2.1 is a substitution rule, but more generally let 
F'(x) = f{x), then 

[F{<K X ))Y =fi<K x Mi x ) 

and J mx)W(x) dx =jf{<Kx)}d<Kx) = J/(«) du (57) 

where u = <f>(x) and after integration u is replaced by fix). For example, 
J {<K x )} n f'i x ) dx = ju n du where u = f(x), 

= n+ 1 

=;rri w *»”‘- 

Sometimes it is necessary to write x = g{u) say instead of u = <j>{x) in 
the given integral. Then the function g(w) is the inverse of <f>(x) and 
hence must be chosen to be monotonic in the range of integration. 
Consider the definite integral 

jj(x)dx = Fib)-F(a). 

Suppose that the substitution x = g(u) is made such that as x varies 
monotonically from a to b, u varies monotonically from a. to /? where 
a = g{a), b = gif), then 

£/(x) dx = F{gifi)} - F{gi*)}. 

Some general substitutions follow, where a and b are given constants: 


if \/(ax + b ) occurs, substitute ax + b = u 2 ; (58) 

if y/(a 2 — x 2 ) occurs, substitute x = a sin 0, —far sC 0 < (59) 

if a 2 + x 2 occurs, substitute x = a tan 9, —far < 9 < fn-; (60) 

if V(* 2 + ° 2 ) occurs, substitute x = a tan 6, — \n < 6 < (61) 

or x — a sinh u; (62) 

if V(x 2 — a 2 ) occurs, substitute x = a cosh u, u > 0. (63) 
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Example (i) Evaluate 


'-r 


dx 

{Vd + * 2 )} 3 ' 


Solution Substitute x = tan 6, then dx = sec 2 6 dd; 

when x = 0, 8 = 0; when x = 1, 6 = \n, and {\/(l + a : 2 )} 3 = sec 3 6. 
Then 


_ ri” sec 2 6 dd _ r*»- 

" Jo sec 3 0 ~ J 0 


Example (ii) Evaluate 




x 2 dx 

Vi* 2 + l) 


Solution The substitution x = tan d transforms the integral into 
Jtan 2 d sec 0 d6 which is no simpler than the original. Hence substitute 
x = sinh u, dx = cosh u du and 


/=Jsinh 2 «du 

= | J(cosh 2 u — 1) du 

= l sinh 2 u — \u 
= \ sinh u cosh u — \u 
= ix\/(x 2 + 1) - | sinh' 1 x. 


6.9.1 Integration using t = tan \x 

Rational functions of sin x and cos x may be reduced to rational 
functions of t by the substitution t = tan — in < \x < $tt. In 
particular this substitution is of use in integrating functions containing 
expressions a cos x + b sin x, where a and b are constants. The method 
uses the following results: 

1 - t 2 21 

cos x = j sin x = } (2 and i sec 2 \x dx = dt 


so that dx = 2dtl{\ + t 2 ). 

The substitution is cumbersome and is used only when the simpler 
methods of earlier sections have failed. 


Example Evaluate J - 


Solution Substitute t = tan \x and use the results above and the 
integral becomes 
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6.10 INFINITE INTEGRALS 

Infinite integrals are those in which the limits of integration or the 
integrand or both become infinite. In section 6.3 the integral of f(pc) 
was defined in the range a < x < b, on the assumptions that f(x) is 
defined at each point in the range and that a and b are finite. This 
definition is now extended to the case where a or b or both become 
infinite, and to the case where J\x) is undefined for a finite number of 
points in the range of integration. 


6.10.1 The limits of integration become infinite 

Let F'(x) = f(x) for all values of x > a, where a is a finite real number, 
then J /( x) dx is defined by 

(**/(*) dx = Lim f */(*) dx = Lim F ( x ) ~ F ( a ) (64) 

Ja X— oo J a X—oo 

when this limit exists. 

Similarly J f{x) dx is defined as 

Lim [ b f{x) dx = F(b) - Lim F(X) (65) 

X— -co JX X—-oo 

when the limit exists. 

If the integral as a limit exists the integral is said to be convergent. 
Otherwise it is divergent. 

Example (i) Evaluate J 

r x dx 

Solution Consider — = log X — log 1 

= log X -*■ oo as X -*■ oo. 

Hence the integral is divergent. 

Example ip) Evaluate J e~ kx dx, k a positive constant. 

Solution Using (64), 

J c~ kx dx = Lim J e~ kx dx 
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6.10.2 Substitution in infinite integrals 

It is possible to transform an integral with infinite limits into one with 
finite limits by a substitution, as in the following example. 

Example Evaluate J j ^ x 2 

C x dx 

Solution Consider I p-p and make the substitution x = tan 0 so 

that the integral becomes 

rtan~ 1 X 

dd = tan -1 X which ->• %n as A"-*- oo. 

Hence it is possible to evaluate the integral directly as 


f dx ft” 

1 l+* 8- Jo " 




6.10.3 The integrand becomes infinite 

Let the integrand f(x) become infinite at the end points of the interval 
a < x < b. Strictly the integral does not exist since the integrand does 
not exist at these points. But a meaning may be attached to the integral, 
called an improper integral. 

Let F'(x) = f(x) for a < x < b. If both F(a + 0)-*a finite limit 
and F(b — 0) -*■ a finite limit then 


F(b - 0) - 

r dx 


lj(x)dx isd 
F(a + 0). 


( 66 ) 


Example (i) Evaluate ^ ^ _ . 

Solution The integrand becomes infinite when x — 2. Consider 

= sin' 1 (1 - i»). 

As d —>• 0, sin -1 (l — \S) -> \tt and hence this is the value of the 


Example (ii) Evaluate J log x dx. 

Solution The integrand becomes infinite (and negative) at x = 0. 
Consider 


J log x dx = |jc log x — xj 


= d — 1 — <5 log d. 

Now — <5 log <5 = (log X)jX where X — \/6 and 

Lim (- d log d) = Lim {(log X)/X} = 
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The definition (66) may be extended to define integration of functions 
which are undefined at a finite number of points within the range. For 
if /(x) becomes infinite at x = c, a < c < b, then using (42) of section 
6.3.1 the integral may be written 

JT /(*) dx = £/(x) dx + £/(x) dx. 

If F(c — 0) and F(c + 0) both approach finite limits then the integrals 
on the right may be evaluated using (66). 

6.11 OTHER METHODS OF INTEGRATION 

The methods of integration of elementary functions given in this chapter 
are not exhaustive. There are many elementary functions which are not 
exactly integrable, such as cos(x 2 ), 1/V(1 — x 4 ), (sin x)/x, exp (—x 2 ), 
and other methods of integration do exist to deal with these. Numerical 
methods of integration are discussed in section 14.8 of chapter 14. The 
integrals of some important functions are tabulated. Further methods 
are beyond the scope of this book. 


EXERCISE 6 

In the following examples a and b are given constants. Find the in 
definite integrals of the functions 1-42: 

1 3a 2 x 8 , 2 (3x 2 + 4x + 2), 

3 (ax 3 + b)\ 4 V(2ax), 

5 V(e*), 6 [V(2 + x 2 ) - V(2 - * 2 )]/V(4 - x% 


7 coth 2 x, 

9 (5 — 4x)~ *, 

10 al(a - x), 

12 (x 2 + 1 )/(x - 1), 

14 sinh x/(l + cosh x), 

16 sec x cosec x, 

18 (1 — sin x)/(x + cos x), 

19 cos 3 x, 

21 sin 2 x cos 3 x, 

23 cos 5 x cosec 7 x, 

25 cosec 4 x, 

27 sin 3x cos 5x, 

29 (2x - l)/(x 2 + 4x), 

31 (2x - l)V(x 2 + 4x + 8), 
33 (x - 5)/V(4x - x 2 ), 

35 e*/(I + e 21 ), 

37 l/x(l - x 2 ), 

39 l/x(x + l) 2 , 

41 (x + l)/(x 2 + 4x - 5), 


8 2V, 

11 (3x + 2)/(3x + 1), 

13 1/(1 + 9x 2 ), 

15 x/V(x a + 1), 

17 (sin x — cos x)/(sin x + cosx), 

20 sin 5 x, 

22 sin 3 \x cos 5 \x, 

24 sin 2 x cos 4 x, 

26 tan 2 5x, 

28 (2x - l)/(x 2 + 4x + 4), 

30 (2x - l)/(x 2 + 4x + 8), 

32 1/V(1 - x - x 2 ), 

34 1/VM1 - x)}, 

36 l/(x 2 + 4x + 7), 

38 (5x - 3)/(x + l)(x - 3), 

40 1/(1 - x 4 ), 

42 (x 4 - x 3 - x - l)/(x 3 - x). 
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Evaluate the integrals 43-49: 

43 J tan* dx, 44 J sin 5 xcos xdx, 

4 '^ 4 ^ 

«r^*. 

Jo 1 + * 2 

Find the indefinite integrals of the functions 50-54 by integrating by 
parts: 

50 sin -1 x, 51 x 2 tan -1 x, 

52 xG Ila , 53 x sin x, 

54 x n logx, n^—l. 

Evaluate the integrals 55-58: 

55 x -2 log(l +x)dx, 56 J xcos xdx, 

57 J x 2 sinj <dx, 58 |" e -21 cos 3 xdx. 

59 Find a reduction formula for /„ in terms of / n _ x given that 


Hence find the value of / 3 . 


60 Prove that if /„ = tan" x dx, then 


/„ + In —2 = (tan" -1 x)l(n — 1), n # 1, 
and hence evaluate J tan 6 x dx. 

I Given that /„ = J e 1 sin* x dx prove that, for n > 1, 
(« 2 + l)/„ = n(n - 1 )/„_ 2 . 

Hence or otherwise evaluate / 4 and I s . 


-In-2 ~ 2 ’ «> 2 » 


and evaluate / 4 and / 5 . 
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63 Find the value of 
n# 5. 




cos 5x cos nx dx (i) when n = 5, (ii) when 


Find the indefinite integrals of the functions 64-68 using the substitu¬ 
tion given: 

64 lj{xV(x 2 - 2)}, (x = I//), 65 l/(e* + 1), (x = -log «), 

66 x(3x 2 - 2) 9 , (3x a -2 = 0, 67 x/V(* + 1), (V(* + 1) = u ), 

68 cos x/V(l + sin 2 x), (w = sin x). 


Examine for convergence, and in cas 
evaluate, the definite integrals 69-76: 


e 1 sin x dx, 


69 r. 

»r , 

73 1 VO-*)’ 

73 Jct 


where the integral converges 
dx 


70 f 

72 r 

74 I V(1 + rf)' 

-urn-- 


(X 2 + fl 2 ) 3 ’ 

dx 

(x + l)(x + 2)’ 
dx 


By writing t = tan \x find the indefinite integrals of the functions 
77-79: 


77 1/(3 + 5 cos x), 78 sin x/(l — sin x), 

79 (1 + tan x)/(l — tan x). 

80 Evaluate 

r i7r dx 
Jo 2 + sin x' 



7 

Applications of Integration 


7.0 INTRODUCTION 

In this chapter the problem of integration will be explained briefly 
without rigour, with reference to geometrical and physical applications. 

7.1 INTEGRATION AS THE LIMIT OF A SUM 

7.1.1 Area as a limit 

The problem is to consider the area A included between a continuous 
curve y = f(x), the x-axis between x — a and x = b, and the ordinates 
x = a and x = b as in Fig. 7.1. 



Fig. 7.1 
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The method is to construct a set of rectangles which lie underneath the 
curve whose total area is S ± and another set which lie above it whose 
total area is S 2 , and investigate as to whether they can be made to 
approach each other. 

Let y = fix) be continuous, increasing, and positive in a < x <b 
and let it be represented by the curve sketched in Fig. 7.1. Divide the 
range b — a into a series of subdivisions with abscissae a, a + h u 
a + h 2 , . . ., a + h n = b and complete the rectangles on bases h u 
h 2 — h ± , h 3 — h 2 , . . ., h n — h n _i. The areas and S 2 are given by 

51 =f(a)hi + f(a + hi)(h 2 - +f(a + h 2 )(h 3 - h 2 ) + . . . 

+f(a + h n .i)(h n - h n _i) 

5 2 = fia + hi)hi + f(a + h 2 )(h 2 - hi) + f(a + h 3 )(h 3 - h 2 ) + . . . 

+ f(a + h n )(h n - h n .i) 

then Si < A < S 2 . 

As the number of subdivisions is increased, and the largest of the 
intervals h k — x is made ‘as small as we please’, then the difference 
between Si and S 2 approaches zero, i.e. Si and S 2 tend to a common 
limit A. If, for all possible values of Si and S 2 , A is a fixed number, A is 
defined as the required area. The difference S 2 — S x is given by 

S a - Si = {f(a + h) -f{a))hi + {/(a + h 2 ) - f(a + hi)){h 2 - hi) 

+ . . . + {/(a + h n ) — f(a + h n -i))(hn - h„_i). 

Let h be the greatest of the numbers h ly (h 2 — hi), (h 3 — h 2 ), . . 
(fi n — h n - 1 ). Then since f(x) is increasing, the numbers {f(a + h^ — f(a)} 
{/(a + h 2 ) — f(a + hi)} etc. are all positive or zero. Hence 

- Si < {f(a + hi) —f(a)}h + {f(a + h 2 ) -f(a + hi)}h 
+ . . .+{f(a + h n ) —f(a + h n .i)}h 

= if(P + K) ~f(a))h 
= {fib) ~fia))h. 

Thus, given a number c say, however small, it is possible to choose h 
such that 


, c 

< fib) -fid) 

then S 2 — Si < c, 

i.e. by taking the subdivisions sufficiently small, the difference S 2 — Si 
can be made ‘as small as we please’, and the area A is a definite number. 
Let x k be a value of x in the kth sub-interval. Then if 


2 f(x k )(h k -h k -i)-+i 
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as the largest sub-interval (h k — 0 where / is fixed and finite, 

then /(x) is said to have a definite integral defined as 

Lim ]£ f(x k ) dx k , where h k — h k _ 1 = 8x k (1) 

x=a 

Note that Fig. 7.1 shows the case of n equal subdivisions of (b — a). 

7.1.2 Area as integral 

Suppose now that F\x) = f(x) also. Then, using the mean value 
theorem, 

F(a + h k ) - F(a + h k .,) = (h k - h k _ X )f(x k ) (2) 

since x k is a value of x in the &th sub-interval. Summing over all values 
of k, 

F(b) - F(a) = | (h k - h k _ ,)f{x k ). (3) 

If the sum on the right of equation (3) approaches a limit as the 
largest of the intervals h k — h k . t = 8x k approaches zero, then that 
limit is the same as in (1). When this limit exists it is the definite integral 
of the function /(x) between the limits x = a and x — b, i.e. 

Lim 2 fyx) 8x = area = J /(x) dx (4) 

the idea of area being omitted as necessary. 

7.1.3 Functions not necessarily continuous increasing and positive 

In section 7.1.1, y =f[x) was chosen to be continuous, increasing, and 
positive throughout the whole range a < x < b. Similar arrangements 
to those of sections 7.1.1 and 7.1.2 hold if the function decreases 
throughout the range, or if it is negative anywhere in the range. A 
function which decreases and increases through the range may be divided 
into intervals in which the function either decreases or increases, and 
the total area defined as the sum of the separate areas. 

A function which becomes negative in the range may be divided into 
intervals in which the function is either positive or negative, and the 
total area defined as the sum of the separate areas. Since the integral 
is negative when the function is negative the area here is the modulus 
of the integral (see section 6.3.2). 

Moreover, the argument applies to all increasing (or decreasing) 
functions, not only to those which are continuous. For example, it is 
possible to integrate the following function which is increasing but not 
continuous: 


fix) = i 

in 

0 < x < i. 

fix) = i 

in 

h<x<l 

fix) = i 

in 

l<x<h 

etc. 



fix) = 1 

when x = 1. 
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See Fig. 7.2 for a sketch of y — f(x). 

This function has an infinite number of discontinuities; nevertheless the 
definition of integral as the limit sum applies. 

An*) 


* 


"l i * ” 

Fig. 7.2 

7.1.4 Rules for definite integrals 

(1) jj(x)dx=- \j{x)dx 

since F(b) - F(a) = ~{F(a) - Fib)} 

(2) J f{x) dx = J fix) dx + f(x) dx 

where c is some value of x between a and b. 

7.2 APPLICATIONS OF INTEGRATION 
7.2.1 Plane areas 

The area enclosed by the curve y = fix), the x-axis from x = a to 
x = b and the ordinates at x = a, x = b is, from (4) of section 7.1.2 

L'm fix) dx = j[ fix) dx (5) 

If x = x(t), y — yit), a < t < f where t is a parameter, 

area = J yit) ^ dt (6) 

Suppose y x — ffx) and y 2 = / 2 (x) are two functions of x which are 
continuous for a < x < b and suppose / 2 (x) </i(x) for a ^ x ^ b. 



130 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 

Then the area bounded by the two curves and the ordinates at x = a, 
x = b is given by 

Lim X f {J\(x) - Mx)} dx = £ {/!(*) - f 2 (x)} dx (7) 
See Fig. 7.3. 

In polar coordinates the equation of a curve is r — f(6). Let P 0 be 
the point ( r 0 , 0 o ) and P x be the point (r 0 + dr, 9 0 + 66), see Fig. 7.4. 




Then the elementary area swept out by the radius OP lies between 
Jr§ 66 and J(r 0 + dr) 2 66, i.e. 
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and thus, in the limit dA/dd — \r 2 . Summing elementary areas, the area 
swept out by the radius OP as 6 increases from a to ft is 

Lim lr 2 66 = J*V 2 d6 (8) 

In terms of the corresponding Cartesian coordinates, x = r cos 0, 
y — r sin 6, tan 6 = y/x and sec 2 6 dd — (x dy — y dx)jx 2 . Also since 
r = x sec 0, r 2 dO = x dy — y dx and the area may be written 

j\xdy-ydx) (9) 

which is the area swept out by the radius as P moves from P to Q 
along the curve. 

Example (i) Find the area bounded by the parabola y — 2 — x 2 and 
the straight line y = x. 



Fig. 7.5 


The curves intersect when x = 1 and x = —2, and for all values of x 
between —2 and 1 the parabola is above the line by an amount y 1 — y 2 
where y 1 = 2 — x 2 and y 2 = x. Hence the total area A between the 
curves is given by 

A = Lim J (Ji ~ y*) A x — J (2 — x 2 — x) dx = 4J. 
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Example (ii) Find the area between the x-axis and the cycloid 
x = a(t — sin t),y — a{ 1 — cos t), 0 < t < hr, a sketch of which is 
given in Fig. 7.6. 

Cycloid x = o(t-sin t),y = o(l - cos () 



Solution Using (6) the required area A is given by 

— cos t)a(l — cos t) dt = 3tm 2 . 

Example (iii) Find the area A enclosed by the cardioid r = a(l + cos 0), 
-7T < 0 < n. 

Solution From (8), 

A = |* |a 2 (I + cos 0) 2 dd = hra 2 . 

7.2.2 Volumes 

The volumes of many solids can be found by summing elementary 
volumes whose cross-sectional area is known, and taking the limit of 
this sum as the thickness of the element tends to zero. Suppose for 
example, that the solid is bounded by two parallel planes perpendicular 



Fig. 7.7 
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to the x-axis at x — a and x = b. Let 3 V be the volume of a repre¬ 
sentative elementary volume, or ‘slice’, whose area of cross-section is 
A(x) and whose thickness is 8x, as sketched in Fig. 7.7. 

Then the required volume is the limit of the sum of such elementary 
volumes as the thickness dx ->■ 0. The total volume V is given by 

V = Lim ^A(x) dx = J A(x) dx. (10) 

In particular, when the solid is a solid of revolution formed by 
rotating a plane area about the x-axis or j-axis, a simple result may be 
obtained. Let the plane area be that bounded by the arc of the curve 
y — /(*), the x-axis and the ordinates x = a, x — b, and suppose that 
this area is rotated about the x-axis to form a solid of revolution. 



The representative element of volume is then a disc of radius y =/(x), 
and hence A(x) — ny 2 , (see Fig. 7.8). Then from (10) the volume of the 
solid V is given by 

V = Lim 2 rry 2 dx = J ny 2 dx or J* n{f(x)} 2 dx. (11) 

Example (i) Find the volume of the ellipsoid formed by rotating the 
area enclosed by the ellipse x 2 ja 2 + y 2 /b 2 = 1 , > 0 about the x-axis. 
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Solution y 2 = b 2 ( 1 — x 2 /a 2 ) —a < x < a 

Volume = 2 ^ nb 2 {\ - x 2 ja 2 ) dx = 47rab 2 /3. 

Writing b = a gives the volume of a sphere of radius a as 4 tto 3 /3. 

Example (ii) Find the volume of the ring generated when the circle 
x 2 + O' — b) 2 = a 2 , a < b, is rotated about the x-axis. 


Solution See Fig. 7.9(a). For each value of x between —a and a there 
are two values of y, say y x and y 2 where y 1 = b + \/{a 2 — x 2 ) and 
y 2 = b- V(a 2 - x 2 ). 



The area of the cross-section of the ring by a plane perpendicular to 
the x-axis at the point x is 

Try 2 - try 2 = ir^ + y s )(y 1 - y a ) = 2nb2y/(a 2 - x 2 ). 

See Fig. 7.9(b). Hence the required volume is 

J 4? rby/(a 2 — x 2 ) dx = 2n 2 a 2 b 

(substitute x = a sin 6 to evaluate the integral). 

It is of interest to note that the volume is the product of %ra 2 , the 
area of the generating circle, and 2nb which is the length of the path 
of the centre of that circle as it rotates through one revolution about 
Ox. This is investigated further in section 7.2.6. 

7.2.3 Centre of mass, centroid 

Let masses m u m 2 , m 3 , . . m n be concentrated at points P u P 2 , P 3 , 
. . ., P n at distances from a fixed plane x = 0 of x lt x 2 , x 3 , . . ., x n . 
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Their moment about the plane is defined to be 

+ x 2 m 2 + x 3 m 3 + . . . + x n m n = j? x k m k , (12) 

known sometimes as the first moment of the system about the plane 
x = 0. 

If the total mass m k = M say, then the centre of mass G of the 

system is the point at which a concentrated mass M would be 
situated in order to have the same moment about x = 0. Let G have 
coordinate x, then the position of G is given by 

xM or x 2 m k = j? x,jn k . (13) 

fc=i fc=i 

If the masses are located at points whose coordinates are (x lt y lt zf, 
(x 2 , y 2 , z 2 ),. . (x„, y n , z n ) with reference to fixed axes, it follows that 

G has coordinates (x, y, z ) given by 

Mx = j x k m k , My = £ y„m k , Mz = £ z k w k . (14) 

fc=i fc=i fc=i 

The centroid of the points P x , P 2 , P 3 , is the geometric point 

defined by equations (14) when the masses m k are taken equal so that 
the centroid or mean centre of the system of points is the point whose 
coordinates are 

* = n£ 1 Xk ’ * = \ *~nt£ 1 Zk 

In order to find the centre of mass of most physical objects it is 
necessary to make certain simplifying assumptions. One is that matter 
in a given solid is continuously distributed throughout the solid. 
Another is that if P is a point in the solid and dV is an element of 
volume containing P whose mass is dm, then the ratio dm/d V tends to 
a definite limit called the density of the solid at P, and denoted by p(P). 

7.2.4 Centre of mass of a plane lamina 

Consider the plane lamina formed by the area A enclosed between the 
curve y = fix), the x-axis from x — a to x = b and the ordinates 
x = a, x = b, see Fig. 7.10. 

Let the coordinates of the centre of mass of this area be (jc, y). The 
elementary area is y dx so that the elementary mass dm = py dx and 
its centre of mass is approximately the point G (x, $y). Taking moments 
about the x- and j-axes, 

x£ydx p = '%xy dx p 

y'lydx P = dx P . (16) 

If p is constant, mass is proportional to area and, in the limit (16) is 
equivalent to 

rb 

Ax = I yx dx 
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Ay = £V dx (17) 

i.e. (x, y) are the coordinates of the centroid of the area. 



Example (i) Find the coordinates of the centroid of a plane semi¬ 
circular lamina of radius a (Fig. 7.11). 



Solution This is the area under the curve 

y = V(« 2 — a 2 ), —a<, > < a 

Total area A = \na 2 . 

Let the coordinates of the centroid G be (a, y,), then x = 0 from 
symmetry, and 

4P = £ \fdx 

= j a j(a 2 -x*)dx 
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= j\a 2 -x 2 )dx = $a 3 . 



Example (ii) Find the coordinates of the centroid of the area contained 
between the x-axis and the cycloid x = a(t — sin t), y = a( 1 — cos /), 
0 < / < 2tt. 

Solution From example (ii) of section 7.2.1 the area A enclosed is 3na 2 . 
Let the centroid G have coordinates (x,y), then from symmetry x = a, 
or alternatively x may be found from 

f2j T 

Ax = a(t — sin /)a(l — cos t)a(l — cos t ) dt 
since dx = a( 1 — cos t) dt. 

Also, Ay = |* ja 2 ( 1 — cos t) 2 a( 1 — cos t) dt 

= £« 3 J (f — 4 cos t + f cos 2 1 + sin 2 1 cos /) dt 

= t — 4 sin t + f sin It + J sin 3 rj 
= fwa 3 . 

Hence y = §a and G is (a, la). 

7.2.5 The centroid of the volume of a solid of revolution 
Let the solid be generated by rotating the plane area under the curve 
y — fix) from x = a to x = b about the x-axis. Using (11) of section 
7.2.2 the total volume V is given by 

V = J ny a dx. 

The elementary volume is iry 2 dx and its centroid is at (x, 0) approxi¬ 
mately. Hence y = 0 and taking moments about the _y-axis, summing 
and proceeding to the limits, 

Vx = j\y 2 xdx. (18) 

Example (i) Find the coordinates of the centroid of a solid hemisphere 
with radius a. 

Solution Let the hemisphere be formed by rotating that part of the area 
of the circle bounded by x 2 + y 2 — a 2 for which x > 0, y ^ 0 about 
the x-axis (see Fig. 7.12). 

Let the coordinates of the centroid G be (x, y), then y = 0 and 
V = fira 3 . From equation (18), 

Vx = r°7 rfxdx 
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Example (ii) Find the coordinates of the centre of mass of a right 
circular cone of height h the density of which varies as the square of 
the distance along the axis from the vertex. 
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Solution Choose the vertex of the cone as the origin of the coordinates 
and the axis of the cone as the x-axis. Let the cone be generated by 
rotating about the x-axis the area enclosed by y = mx, 0 < x < h the 
x-axis between x = 0 and x = h and the ordinate x = h. Let the 
density be kx 2 where k is a constant. Let the coordinates of the centre 
of mass G be (x, y) then y = 0 by symmetry, see Fig. 7.13. 

Total mass = limit of ^iry 2 8x kx 2 


= k-n J m 2 > 


= \k-nm 2 h 5 . 


Taking moments about Oy, 

\kiTm 2 h s x = |* (Try 2 dx) (kx 2 )x 
— kn j m 2 x 5 dx 


= \k-nm 2 h 6 

and x = f h and G is (ih, 0). 

The reader is left to check that if the density is constant, so that mass 
is proportional to volume, then G is the point (f h, 0). 



Fig. 7.14 


7.2.6 Pappus’ theorem 

When a plane area A, as in Fig. 7.14, is rotated about an axis in its 
plane which does not intersect the area, there is a relation between the 
centroid of the area and the volume of the solid of revolution formed 
by rotating the area about the axis. This relation is a formula in the 
form of the theorem below, of use in calculating the volume when the 
centroid is known, or in calculating the centroid when the volume is 
known. 
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For, let the axis of revolution be the x-axis, and SA an element of area 
distant y from the x-axis. If V is the volume obtained by rotating A 
about Ox, 8 V the corresponding element of volume obtained by 
rotating 6A about Ox, and y the coordinate of the centroid of A, then 
8V^2ny8A. 

Hence V = Lim ^2 ny 8A 

— 2ir Lim ^y 8A 

= 2 it (moment of area about Ox) 

= 2 ,n(Ay) 

or V = 2 njfA, (19) 

and Pappus' theorem states that if a plane closed area is completely 
rotated about an axis in its own plane which does not intersect the area, 
the volume of the solid formed is equal to the product of the area and 
the length of the path traced out by the centroid of this area. Note 
that the axis may touch the area. 

Example Find the centroid of a plane semicircular lamina of radius a. 

Solution The semicircular lamina, as in Fig. 7.11, is chosen to be that 
bounded by the curve x 2 + y 2 — a 2 , y > 0 and the x-axis, — a < x < a. 
Hence A = \na 2 , V = W/3, 



= Ana 3 /in 2 a 2 
4 a 
Jn- 

Compare this method with that of example (i) of section 7.2.4. 

7.2.7 Moments of inertia 

Let masses m x , m 2 , m 3 , . . ., m, be concentrated at points P x , P 2 , P 3 , 
. . ., P n whose distances from a fixed line / are r u r 2 , r 3 , . . ., r n . Then 
the second moment or moment of inertia of the system about / is defined 
as 

/, = ms{ + m 2 r 2 + m 3 r 2 + . . . + m n r 2 = | m t rf (20) 

Let M be the total mass of the system, then the equation 

Mk 2 = /, (21) 

defines k h the radius of gyration of the system about /. 

If instead of a system of discrete masses there is a continuous distri¬ 
bution of mass, the assumption is made that the mass may be divided 
into small elements of mass 8m such that, if r represents the distance 
of one point of the element 8m from an axis, then all points of that 
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element will be within distances r ± c of the axis where c ->■ 0 as the 
largest value of dm -> 0. Then for each element of mass, its moment 
of inertia lies between (/• — c) 2 dm and (r + c) 2 dm and hence differs 
from r 2 dm by a quantity which is ‘as small as we please’ as dm -> 0, i.e. 

Lim ^r 2 dm = jr 2 dm = I. (22) 

Example (i) Find the moment of inertia of a thin u ni form rod of mass 
M and length / about an axis which is the perpendicular bisector of 
the rod. 

Solution Choose the origin as the mid-point of the rod and the x-axis 
along the rod. Then (22) gives 

/= x 2 ydx 

J-ii 1 
_ ME 
~ 12 ' 



Example (ii) Find the moment of inertia of a uniform disc of mass M, 
radius a about an axis through its centre perpendicular to the plane of 
the disc. 

Solution Consider the disc split into concentric circles so that at 
distance r from the centre the width of an element of area is dr, see 
Fig. 7.15. 
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Then dm — 2-nr dr(M/ira 2 ) = (2M/a 2 )r dr and 

1 = r 3 dr = \Ma 2 . 

Example (iii) Find the moment of inertia of a solid homogeneous 
sphere of total mass M and radius a about a diameter. 

Solution Choose axes Ox, Oy, Oz through the centre of the sphere. 
Let the sphere be cut into elementary discs by planes perpendicular to 
the z-axis at intervals dz as in Fig. 7.16. 



The disc between z and z + dz is approximately a cylinder of radius 
V(a 2 — z 2 ), thickness dz and mass dm = ^^5 7r(a 2 — z 2 )<5z, and its 
moment of inertia about Oz is ^ (a 2 — z 2 )dz \{a 2 — z 2 ), (applying 

the result of example (ii) above). Summing the moments of inertia for 
the separate discs and letting dz -*-0 

= iMa 2 . 


The moment of inertia of a rigid body about a line is important when 
the rotation of the body is being considered. Moments of inertia of only 
the simplest bodies can be found by integration. There are two theorems 
which may be used to calculate moments of inertia. 
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The parallel axis theorem relates the moment of inertia of a body 
about an axis through its centre of mass G to the moment of inertia 
about a parallel axis / distant d from G (see Fig. 7.17). 

Let I a be the moment of inertia of a body of mass M about an axis Gz 
through the centre of mass G, and let I t be the moment of inertia about 



a line / parallel to Gz so that / is distant d from Gz. Let m r at P(pc„ y„ z T ) 
be an element of mass of the body. Then, from formula (20), 

I, = Lim 2 ™ r 0? + (d~ x r ) 2 } 

= Lim [2 m r {x 2 + y 2 ) - 2d% m r x r + d 2 Jm r ], 

i.e. I, = I a + d 2 M. (23) 


The term 2»J r x r is zero since G is the centre of mass. 

For example, using the result of example (i) of section 7.2.7 and the 
parallel axis theorem, the moment of inertia of a thin uniform rod of 
length / about an axis through one end perpendicular to the rod is 
Ml 2 /12 + (|0 2 M which is \Ml 2 . 
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A second theorem, for plane laminas only, which is sometimes called 
the perpendicular axis theorem , states that the moment of inertia of a 
plane lamina about an axis perpendicular to its plane is equal to the 
sum of the moments of inertia about any two perpendicular axes in 
the plane which intersect on the first axis. For suppose the plane of the 
lamina is chosen as the xy-plane. Two perpendicular axes in the plane 
are Ox and Oy which intersect on Oz. If I u / 2 , and / 3 are the moments 
of inertia about Ox, Oy, and Oz and m r is an element of mass at 
(x r , y r , 0), then 

/ 3 = Lim ^m r (xf + yf) 

= Lim [^m T x 2 + ]£m r y r 2 ] 

or I 3 = r 2 + I x . (24) 

For example, the moment of inertia of a thin disc of radius a about 
a diameter may be found by equation (24). Choose axes Ox, Oy, and 
Oz through the centre of the disc such that Ox and Oy lie in the plane 
of the disc and Oz is perpendicular to the plane. From symmetry, 
h = h, and from example(ii) above/ 3 = \Ma 2 . Hence I x = I 2 = \Ma 2 , 
which is the required moment of inertia. 

Moments are also of importance in statistics. The first moment, 
defined by (12) of section 7.2.3, is used in computing the mean, and the 
second moment, defined by (20) of section 7.2.7, is used in computing 
the variance of a given set of data. These will be met with again in 
Chapter 15. 

EXERCISE 7 

1 Calculate the area bounded by the two parabolas y = x 2 , y = ix 2 
and the line y = 2x. 

2 Find the total area contained in the astroid x — a cos 3 t,y = b sin 3 1 , 
0 < t < In. 

3 Find the area bounded by the cardioid x = a{2 cos t — cos 2 1), 
y = a(2 sin t — sin 2 1), 0 < t < 2rr. 

4 Find the total area inside the circle r = 2a sin 0. 

5 Find the volume of the solid generated by rotating about the x-axis 
that part of the parabola y — 2x — x 2 which lies above the x-axis. 

6 Prove that the volume of a spherical cap of height h cut from a 
sphere of radius a is inh 2 (3a — h). 

7 The centres of two spheres of radius 3 cm and 4 cm respectively are 
at a distance 5 cm apart. Prove that the volume common to both 
spheres is 92n/\5 cm 3 . 

8 Sketch the curve y = x cos x for 0 < x < and find the volume 
generated when the area contained between this curve and the 
x-axis is rotated about the x-axis. 

9 Find the area of the region enclosed by the curves x 2 = ay, ay 2 = x 3 
and find also the volume generated when this region is rotated about 
the x-axis. 
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10 Show that the curve x(x 2 + y 2 ) = y 2 lies entirely in the region 
0 < x < 1, and make a rough sketch of the curve. Find the area 
of the region in the first quadrant lying between the curve and the 
line x = \ and find the x coordinate of the centroid of this region. 
(Hint: Use the substitution x — sin 2 0 to evaluate the integral.) 

11 Find the area bounded by one loop of the curve y 2 = x 2 (4 — x 2 ) 
and the coordinates of the centroid of this area. Hence, or otherwise 
find the volume generated when this area is rotated about the 
y-axis. 

12 Use the substitution x = 2 sin 2 6 to evaluate J x n (2 — x)*dx 

where n is a positive integer. Find the area enclosed by the loop of 
the curve y 2 = x\2 — x) and the coordinates of the centroid of 
that part of this area which lies in the first quadrant. 

13 Sketch the curve x = cos 3 t, y = sin t, 0 < t < 2tt. Find the area 
enclosed by the curve and the volume of the solid formed when the 
upper half of the area is rotated about the x-axis. 

14 Find the points of intersection of the curves r = a{ 1 + cos 9) and 
r — 3a(l — cos 0) and show that the area enclosed between these 
curves is (lUr/2 — 9\/3)a 2 . 

15 Find the centroid of the area bounded by the lines y = 0, x = at 2 
and that part of the parabola y 2 = 4ax from the origin to the 
point {at 2 , 2 at), where a and t are both positive. Find the volumes 
generated when this area is rotated (i) about the x-axis, (ii) about 
the y-axis. Find the value of t such that these two volumes are equal. 

16 The area bounded by the parabola y = x 2 and the line y = 4 is 
rotated about the y-axis to generate a solid. Find the moment of 
inertia of this solid about the y-axis, if the total mass of the solid 
is M. 

17 Find the centroid of the sector of a circle of radius a which subtends 
an angle 2a at the centre. Hence, or otherwise find the volume of 
the solid generated when the sector is rotated through one revolution 
about a bounding radius. 

18 Find the moment of inertia of a homogeneous solid sphere of mass 
M and radius a about an axis which is a tangent to the sphere. 

19 A right circular cone has a base radius a and altitude 2 a and mass 
M. Find the moment of inertia of the volume of the cone about its 
axis. 

20 Find the coordinates of the centroid of the area in the first quadrant 
bounded by two concentric circles and the coordinate axes, if the 
circles have radii a and b, b> a> 0, and the origin is their 
common centre. Find also the limiting value of these coordinates as 
a approaches b and discuss the meaning of this result. 

21 The area bounded by the curve y 2 = 4ax, the line x = a and the 
x-axis is rotated (i) about the x-axis, (ii) about the line x = a, 
(iii) about the y-axis. Find the volume generated in each case. 
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Matrices and Determinants 


8.0 INTRODUCTION 

The linear algebraic equation of the type 

= 3 *i + lx 2 + 6*3 

is of common occurrence in engineering. For example, the x’s could 
be the electrical currents flowing in the branches of a resistive network 
as in Fig. 8.1 and the the voltage drop across the network. 



Again, y x could be the weekly output of a factory which uses three 
different kinds of machine making the same product; the x’s are the 
quantities of each machine installed and the weekly outputs of the 
machines are 3, 7, and 6 respectively. 

It is usual to have to consider a set of two or more such linear alge¬ 
braic equations and it is straightforward to handle up to three such 
equations with integer coefficients without using other than simple 
algebra. However, if the coefficients are decimal or if there are many 
more than three equations it becomes necessary to employ a systematic 
algebraic process which will permit the important features of the set 
of equations to be examined and manipulated, and the information 
contained in the set to be stored for reference without necessarily having 
to solve the equations. 

The most important item of information contained within a set of 
equations is that provided by the coefficients of the algebraic terms. 
For example, the pair of equations 

y x = 3 *! + 7 * 2 + 6*3 
y 2 = 5*! — 2*2 + 3*3 




MATRICES AND DETERMINANTS 


147 


may be used to calculate thej’s from given values of the x’s. No matter 
how the x’s are changed the /s are immediately determinable and are 
controlled by the coefficients of the right side, that is by the values of 
the numbers in the ordered array 



Any rectangular array which stores definite information is called a 
matrix. A matrix has no numerical value, it is simply an assembly of 
terms which as a carrier of information may be manipulated according 
to certain algebraic rules. It is necessary in any such manipulation to 
know the exact nature of the information, for example the matrix 



becomes meaningful only when the ‘key’ is given as in 
W L D 
Newcastle United 6 2 5 

Norwich City 7 5 1 

Newtown Wanderers 12 0 1 


8.1 DEFINITION AND NOTATION 

A matrix A of m rows and n columns is said to be of order m X n. 
The components of the matrix are called elements. A double suffix 
notation is used, the element of the ith row and the jth column of the 
matrix A being written a if . The array is usually enclosed in curved 
brackets ( ). Other notations in common use are square brackets [ ] 
and double bars 11 11. When there is no ambiguity or doubt the matrix 

may be written A, (a,,), or simply a fj . Thus 


A = (flu) — a u — I 


012 013 

022 023 

I 031 032 033 




aj 


( 1 ) 


An important matrix is the square matrix of order n X n, i.e. one 
of n rows and n columns. Such a matrix occurs for instance when 
dealing with a linear transformation from one set of space variables to 
another. In the two-dimensional Fig. 8.2 the point P has coordinates 
(a, b ) when referred to the rectangular Cartesian system Oxy, and 
coordinates (A, B ) when referred to the system OXY. 
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The transformation is given by 

A = a cos 9 + b sin 9 \ 

B = —a sin 9 + b cos 9 / 

and the transformation matrix is the square matrix of order 2 

/cos 9 sin 9\ 

\—sinfl cos 9/ 


( 2 ) 


( 3 ) 



The matrix (3) can also be interpreted as representing a rotation of the 
coordinate axes Oxy through an angle 9 to the position of OXY. 
Alternatively if the coordinate frame Oxy is fixed as in Fig. 8.3 the 
matrix (3) can be interpreted as representing the rotation of the line 
OP, where P has coordinates (a, b), through an angle 9 to take up the 
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position OQ, where Q has coordinates (A, B). Changing the sense of 
rotation would then mean that the matrix 
/cos 0 —sin 0 \ 

\sin 0 cos 0 J W 

can represent the rotation of the line OQ in the plane of Oxy through 
an angle 0 in the anticlockwise sense. 

A matrix with m = 1 

(an a 12 ^13 • • • a ln ) 

is called a row matrix of order n, and a matrix with n = 1 



is called a column matrix of order m. 

A general linear transformation of m variables y t , i = 1,2, 3. m 

and n variables x } ,j = 1, 2, 3, . . n has the form 

}>1 = <*11*1 + <*12*2 + <*13X3 + • • • + <*ln*n \ 

y a = Oai*i + a a2 x 2 + a a3 x 3 + • • • + a 2n x„ I ^ 

y m = a ml X! + a m3 x 2 + a m3 x 3 + . . . + a mn x n J 

In (5) the variables y { and x, may be represented by two column 
matrices 



and the transformation matrix by A — (a tJ ). The equations may be 
written in compact notation in the form 

y, = 2 a u x i * — 1,2,3,. . m 
or simply y t = a ti x, ( 6 ) 

where the repeated index j indicates a summation from j = 1 to j = n. 
It will be seen in section 8.2.6 that relation ( 6 ) is simply a matrix 
multiplication and that (5) can be written in matrix form 
y = Ax. 
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If i,j take only the values 1, 2, 3 then equation ( 6 ) can be considered 
as a relationship connecting together the pair of rectangular Cartesian 
coordinates (Xj, x 2 , x 3 ) and (yuy 2 ,y 3 ). Suppose P is the point 
(*i, * 2 , x 3 ) and Q is the point (yi,y 2 ,y 3 ) such that OP = OQ, then 
OQ can be obtained by rotating OP through the angle POQ in the 
plane of OP and OQ. The matrix a u of equation ( 6 ) can then be 
thought of as representing this rotation and must carry information 
concerning both the magnitude of the angle and the location of the 
plane in which it lies. This representation of a matrix is investigated 
further in section 9.3.7. 

8.2 MATRIX ALGEBRA 

Consider now the basic algebraical manipulations of addition, sub¬ 
traction, and division as applied to matrix arrays. 

8.2.1 Equality of matrices 

Two matrices are equal if and only if they are of the same order and 
the corresponding elements are equal. 

8.2.2 Addition 

Consider the linear transformation 

yi = «uXi "i" 0ia*2i 

y 2 — a 2 1*1 + # 22*3 f (7) 

y 3 = a 3 i*i + a 32 x 2 ) 
having the transformation matrix 



and a second transformation 

Z 1 ~ * 11*1 + * 12 * 2 ) 

z 2 = £> 21 *! + b 22 x 2 1 (9) 

Z 3 = * 31*1 + *32*2) 

having the transformation matrix 

l b - M 

B= *21 *22. ( 10 ) 

\*3i bj 

If new variables ht, w 2 , and are constructed by adding the respective 
y and z variables, then equations 

WT = Ti + z i = (011 + *n)*i + (a 12 + *12^*2) 

w 2 = y 2 + z 2 = (a 21 + *2i)*i + (0 22 + b 22 )x 2 

tv 3 = }’ 3 + z 3 = (a 31 + b 31 )x 1 + (a 32 -f *3 2 )* 2 J 


01) 
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form a linear transformation having a matrix 

( «n + b lx a 12 + A 12 \ 

tfai + b 2 i a 2 2 + b 22 J. (12) 

a 31 + bgi a 3 2 + b 32/ 

This process may be extended and leads to the rule of matrix addition. 

Two matrices A and B, each of order m x n, when added together 
form another matrix of order m x n whose elements are the sums of 
the associated elements of A and B, i.e. 

A + B = (a tj ) + (by) = (a ti + by) (13) 

Note 1 No meaning can be attached to the addition of matrices of 
different orders. 

Note 2 The result (13) may be extended to any finite number of matrices, 
i.e. 

A + B + C = (Oy) + (by) + (Cy) = (fly + by + Cy) (14) 
and so on. 

Note 3 The commutative law and the associative law are true, i.e. 

A + B = B + A and (A + B) + C = A + (B + C). 
Example 



8.2.3 Multiplication by a scalar 
Suppose new variables 

Pi = 2 ji, p 2 = ly 2 , and p 3 = Xy 3 
are defined, where X is a numerical magnitude usually called a scalar, 
and used with equations (7), then 

Pi = Xa xl Xi + Xa 12 x 2 \ 

p 2 = Xa 21 x 1 + Xa 22 x 2 1 (15) 

p 3 = Xa 31 x i + Xa 32 x 2 j 

is a linear transformation with a matrix (Xa u ). This suggests the 
following rule. 

Scalar multiplication 

If a matrix is multiplied by a scalar X then every element is multiplied 
by X, i.e. 


XA = X(a u ) = (Xa tj ). 
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Sample® l(J \ J) = (,‘ ’ „)• 

Example (ii) -l(J “ 5 ) = _*)• 

Conversely, a matrix has a factor X if each element contains the factor X. 

8.2.4 Linear combination of matrices 

The rules of sections 8.2.2 and 8.2.3 lead to the combination rule in 
the form 

XA + [iB + vC = (Xa tj + fibij + vc tJ ) (16) 

where X, p, and v are scalar quantities. In particular if v = 0, X = 1, 
and fi = — 1 , then 

A - B = (a (J - b^. (17) 

8.2.5 Zero matrix 

The subtraction of two equal matrices of order m x n gives a zero 
matrix of order m x rt, that is one whose every element is zero. 
Example 



_ /18 13 8\ /18 13 8\ 

l 8 13 —12/ [ 8 13 -12/ 


/° 0 0 \ 

\0 0 0 y 

8.2.6 Multiplication of matrices 

Consider the successive linear transformations 

*1 = * 11*1 + * 12*2 
*2 = * 21*1 + * 22*2 


Mt a 


y x — a xl Xx + a 12 x 2 and 
y 2 = & 2 iXi + a 22 x 2 
y 3 = a 31 x x + a 32 x 2 
with coefficient matrices 

Mi fli 2 \ 

^ — j a 2X a 22 1 and 

W aj 


respectively. 
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They are equivalent to a single linear transformation 

yi = (a 1 ib 11 + a 12 b 21 )z 1 + (a^b^ + a 12 b 22 )z 2 
y 2 = (a 21 b lx + a 22 b 21 )z 1 + (a 21 b 12 + a 22 b 22 )z 2 
y 3 — (& 3 ibn + a 32 b 21 )z x + (a 31 b 12 + a 32 b 22 )z 2 
with a coefficient matrix 

( a nbn + u 12 b 21 a nbi 2 + o l2 b 22 \ 
a 2lbll + #22^21 a 2lb\ 2 + u 22 b 22 1 . 

a 3 Ai + a 32 b 21 a 31 b 12 + a 32 b 22 ) 

It is seen that the elements of C are obtained from those of A and B 
by summing the products of the elements of the rows of A with those 
of the columns of B. For example, the element of the third row and 
the first column of C is obtained from the third row of A and the first 
column of B. 


Matrix multiplication 

Given a matrix A of order m x n and a second matrix B of order 
n x p, then the product AB is a third matrix C of order m x p, such 
that the element of the ith row and the &th column of C is the sum of 
the products of the corresponding elements of the ith row of A and 
the kth column of B. In a more concise notation, if A = (a tJ ) and 
B = (b jk ) where i = 1,2,3,. , ., m;j = 1,2,3,. . n; and k — 1,2, 3, 
. . ., p, then C = (c ( fc) where 

c ik = a {j b jk (summed over j) (18) 

= a n bi k + a l2 b 2k + a i3 b 3k + . . . + a in b nk . 

Note that for the product AB to exist it is essential that there are the 
same number of columns in A as there are rows in B. 


Example (i) 



10+12 + 3 
-5 + 8 + 12 


(2 X 3) (3 X 2) 


6 + ° + p 7\ 

-3 + 0 + 4/ \15 l) 

(2 X 2) 


Example (ii) 


(; •)(-:: H 


15 + 6 5 + 12\ H 21 17\ 

12 + 0 4+ 0 = 8 12 4) 

9 + 2 3+4/ \5 11 7/ 

(3 X 3) 
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Example (iii) 

/ 2 -1 \ 

13 0 \ 

(4 5 6 -1 0 2) ° 

1-2 1 / 

\ 4 1/ 

(1 X 6) (6 X 2) 

= (8 + 15 + 0- 5 + 0 + 8 
= (26 -29) 

(l x 2 ) 


—4 + 0 — 24 — 3 + 0 + 2) 


Example (iv) 

12 4\ 

1 3 1 1(5 6 7) has no meaning. 

(2X2) (1X3) 

From the definition of matrix multiplication it is clear that if + is of 
order m X n then AB and BA can both exist if and only if B is of order 
n x m. The product AB is then of order m X m and the product BA 
is of order n x n. These two products are of different order if m # n, 
and it may be concluded that the product of A and B is non-commuta- 
tive, i.e. AB # BA. This is illustrated in the first two examples above. 
Even when n = m and the two products are of the same order they are 
generally unequal. 

Example (v) 



A particular case of the equality AB = BA is discussed in section 8.2.9. 

It is possible to multiply AB by a further matrix C providing the 
matrix order is correct. Given three matrices A = (a u ), B = (b jk ), and 
C = ( c kl ) of orders m x n,n X p, and p X q respectively the product 
AB, of order m x p, is given by 

AB = ( a ff b fk ), 

summed over j — 1, 2, 3, • . ., n. The product of AB and C, i.e. (AB)C, 
of order m x q, is given by 

( AB)C = (a lt b jk c k l) 

in which each element is a double summation taken over j = 1 , 2 , 3, 
• • •» n and k = 1, 2, 3, . . ., p. The order of the summation is imma¬ 
terial and the same result would be obtained by forming the product 



MATRICES AND DETERMINANTS 155 

A(BC). The product of A, B, and C is associative. It should be clear 
that the product is also distributive, i.e. 

(R + S)C = RC + SC 
where R and S are each of order r x p. 

Example (vi) If A and B are arbitrary square matrices of equal order 
then 

(A + Bf = (A + B)(A + B) = AA + AB + BA + BB. 
Example (vii) If A is any square matrix then 

AAA = A A 2 = A 2 A = A 3 

where A 2 means AA. 

Example (viii) Verify the associative law of multiplication of matrices 
by forming the product ABC in two different ways, where 



Example (ix) The general linear transformation equations (5) of section 
8.1 may be written 



or simply y - Ax. 

Example (x) Solve for x and y given 

f! -30-P- 
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Solution Multiply the left side 

[2\x + 6y\ _ /24\ 

\ 2x - 5 y) ~ ll9^ 

and hence, by section 8 . 2 . 1 , it follows that 
2\x + 6y = 24 
2x-5y= 19. 

The pair of equations has the solution x = 2, y = —3. 

Example (xi) Find the points gained by each team shown in the foot¬ 
ball league table in the section 8.0, on the basis of 3 points for a win 
(IV), 1 point for a draw (£>), and 0 points for a lose (L). 

Solution The points awarded are as follows: 

( Newcastle United \ / 6 2 5\ /3\ 

Norwich City 1 = I 7 5 11 (0 J 

Newtown Wanderers/ \12 0 l/ \l/ 



8.2.7 The unit matrix 

Suppose an arbitrary rectangular matrix A = (a tj ) of order m x n is 
unchanged after multiplication by a square matrix B = (b jk ) of order 
n x 72, so that AB = A. The elements b jk must then satisfy the equation 
a tk = a u b jk , 

(summed over j = 1, 2, 3, . . ., n) with i = 1, 2, 3, . . ., m and 
k = 1, 2, 3, . . ., n. This means that the b jk must satisfy a set of 
equations 

Oil = Onbii + a 12b 2\ + ■ ■ ■ + OirBnl 

«12 = 011*12 + 012*22 4- . . . + a ln b n2 

013 = 011*13 + 012*23 + ■ ■ • + 0m*n3 

etc. 

Since the an, a i2 ■ ■ . are arbitrary, the set of equations leads to 
bn = b 22 = b 33 = . . .=*„„= 1 


with all other b jk = 0 . 
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The matrix B has a similarity to unity in scalar algebra and is called 
the unit matrix of order n. It is denoted by /„. For example, 



Given the matrix A of order m x n it is easily shown that not only is 
AI„ = A, but also I m A = A, and for this reason the unit matrix is 
sometimes called the identity matrix. 

When the order is either understood or immaterial the suffix may be 
dropped and the unit matrix written I. 



Example (i) Evaluate I A and AI when A — 
Solution 



Example (ii) If 

12 1 1 \ 

A = 1 2 1 show that A 2 - 5A + 4/ = 0. 

\l 1 2/ 


Solution 



/4 0 
47= (0 4 
\0 0 


and A 2 - 5A + 41 = 0. 
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A matrix of the form XI where A is a scalar is called a scalar matrix. 
The latter is itself a particular case of a matrix with non-zero diagonal 
terms and all other terms zero. A matrix such as 



is called a diagonal matrix. 

8.2.8 The transposed matrix 

Given an m X n matrix A = (a tj ), the transpose of A is written A' or 
A T and is that matrix of order n x m obtained from A by interchanging 
the rows and columns. For example, 

(6 2 4 A 12 

'‘"l? 3 2 8 ) and A = A “U 2 I’ 

8.2.9 The inverse matrix 

The next algebraic operation is that corresponding to division. Multi¬ 
plication and division are inverse processes and it is to be expected that 
‘matrix division’ will be dependent on the multiplication process. In 
scalar algebra division by 3, for example, can be considered as the 
multiplication by the reciprocal of 3. The latter is that number which 
when multiplied by 3 gives unity. In matrix algebra, given a matrix A 
the reciprocal or inverse matrix of A is defined as the matrix B such that 
AB = BA = I. Such a statement requires, of necessity, that both A and 
B are square matrices. When A is square and B exists it will be denoted 
by A~\ i.e. 

AA- 1 = A' 1 A = / (19) 

Note 1 It is possible that A~ x does not exist. This possibility is discussed 
in section 8.5. 

Note 2 The condition that the matrices A and B commute in a general 
manner, i.e. that AB = BA I, cannot be dealt with here. It may 
be shown that such matrices can be found by considering the products 
of 
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Example (i) Verify that 


( 


1 

-2 

0 


- 2 \ /3 

41 is the inverse of 12 

l) \2 


Solution 



I 1 1 “ 2 \/ 3 

and 1-2 -1 4 2 

\ 0 -1 1 / \2 




The fuller understanding and the evaluation of an inverse matrix is 
dependent on the use of determinants to be defined and discussed in the 
next section. However, one application of the inverse matrix is illus¬ 
trated in the following example. 

Example (ii) Solve the system of equations 

3*i + x a + 2 x 3 = 5 

2x 1 + x 2 = — 3 
2xx + x 2 + * 3 = 3. 

Solution In matrix form, the system of equations is written Ax = B 
where 


1 3 1 2 \ M / 5 \ 

A =12 1 0, *= x 2 ), and B = (—31. 

\2 1 1/ W \ 3/ 

Pre-multiply by the inverse matrix A~ l , then 
A' 1 Ax = A~ l B 
or Ix = A~ 1 B 

or x = A~ 1 B. 

Using the result of the previous example 



and hence Xx = —4, x 2 = 5, and x 3 = 6. 
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8.3 DETERMINANTS 

The solution of the pair of equations 


<*U*1 + *12*2 + h = 01 

<*21-*1 + <*22*2 + b 2 = Of 


X 1 — (<*12^2 <*22^l)/(<*ll<*22 — <*12<*2l)j 

X 2 = — (<*11^2 — <*21^l)/(<*ll<*22 — <*12<*2l)J 


( 20 ) 

( 21 ) 


provided that the denominator is non-zero. The three expressions in 
brackets arise from the cross products of the pairs of elements 



Such expressions are called determinants of the second order. The 
augmented coefficient matrix of (20) is the 2x3 matrix 


(au a 12 M 

V<*21 <*22 ^2/ 


( 22 ) 


and each determinant of (21) is an evaluation associated with the three 
2x2 matrices available in (22). For example (a 12 b 2 — a 22 b x ) is the 
evaluation of the determinant associated with the matrix 


/<*12 M 
\<*22 b 2 ) 

This determinant is written 

\a 12 bA 

I <*22 ^ 2 |- 

The solution (21) may be written in determinant form 


x 1 _ —x 2 _ 1 

«i2 bi <*u bx an a 12 

<*22 b 2 a 2 i b 2 a 2 i a 22 


(23) 


Example The matrix 
of the second order 


-7 

1 


contains the three determinants 



whose values are 17, 2, and 1 respectively. 
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Likewise, if three equations in three unknowns 

<*11*1 + <*12*2 + <*13*3 + *1 = O'! 

<*21*1 + <*22*2 + <*23*3 + h = 0 J (24) 

<*31*1 + <*32*2 + <*33*3 + *3 = 0 J 

have a solution, each x t is found as the ratio of two six-termed expres¬ 
sions. The denominator is common to each x, and has the value 

<*11<*22<*33 — <*11<*23<*32 "T <*12<*23<*31 — <*12<*21<*33 + <*13<*21<*32 

- <*13<*22<*31 (25) 

The reader should verify this expression by solving equations (24) for 
x 4 by eliminating x 2 and x 3 . 

An expression of the type (25) associated, in this case, with a 3 x 3 
matrix whose elements are the coefficients of the x ( of (24) is called a 
determinant of the third order and is written 

I <*11 <*12 <*13 I 

<*21 <*22 <*23 • 

I <*31 <*32 <*33 I 

The other three third order determinants in the solution of (24) are 
associated with the further three 3x3 matrices obtained from the 
augmented coefficient matrix of (24). Each determinant contains six 
terms, three prefixed with a + sign and three with a — sign. 

Again, if four equations in four unknowns x, have a solution, each 
x ( is found as the ratio of two 24-termed expressions. The denominator 
term is common to each x, and has the value 

<*11<*22<*33<*44 <*11<*22<*34<*43 + <*11<*23<*34<*42 + • • • (26) 

to 24 terms. The expression (26) is associated with the 4 x 4 coefficient 
matrix (a u ) and is the evaluation of the fourth order determinant 
written 

<*11 <*12 <*13 

<*21 <*22 <*23 

<*31 <*32 <*33 

<*41 <*42 <*43 

The 24 terms in the evaluation of (27) (in which 12 are prefixed with 
the — sign) are each of the type a la a 2g a 3r a 4d where oc, /?, y, S are the 
numbers representing the columns from which the element is taken and 
a flyd is some arrangement, or permutation, of the natural order 1234. 
The first term with the + sign is <*ii<* 22 <*33<*44 and is called the leading 
term. The number 24 is simply the number of ways in which the natural 
order 1234 can be arranged. Six of these arrangements are 1234, 1243, 
1324, 1342, 1432, 1423. The reader should satisfy himself that there are 
4!, i.e. 24 terms in all. 






162 


MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


8.3.1 The determinant of the nth order 

An n x n matrix A or (n y ) has associated with it a determinant of the 
nth order, i.e. n rows and n columns, written 


Ml = Kl 


an 012 • • • a ln 

021 a 22 ■ ■ ■ a 2n 


(28) 


The determinant (28), unlike the matrix A which remains a manipula¬ 
tive array of elements, has a definite value and consists of the sum of n ! 
terms each of which is of the form ±a la a 20 a 3y . . . a ni where a/3y . . .2 
is some arrangement of the natural order 123 . . . n and the sign of 
the term depends on the arrangement. 

The formal mathematical approach is to define the value of (28) by 
this sum and to derive the properties of determinants using the term by 
term evaluation. This approach will not be dealt with here but instead 
the evaluation and properties will be deduced from (25) for third order 
determinants, verified as necessary for fourth order determinants and 
then assumed to be true for the general determinant of order n. 


8.3.2 Minors and cofactors 
Consider the third order determinant 
I «n <Ji2 a 13 I 

Ml — °21 °22 a 23 

| 031 032 033 | 


— 011022033 — 011023032 + 012023 fl 31 

— 012021033 + 013021032 — a l3 a 22 a 3l ■ (29) 

Each element a tJ of the determinantal array (29) has associated with it a 
determinant of the second order obtained by striking out the row and 
column containing the element a t/ . This determinant is called the minor 
of a tl and written For example, the minor of a 12 is the second 
order determinant 


M 12 


021 023 

^31 ^33 


having the value (a 21 a 33 — a 31 a 23 ). A re-arrangement of the right side 
of (29) shows that 


Ml — 01l(022033 032°23) — 012(021033 — 031023) 

+ 013 ( 021«32 — 031022 ) 

= 0 nA/n — Oi 2 M 12 + a 13 M 13 . 


( 30 ) 
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The re-arrangement of (29) can be performed in five other ways using 
the elements of the two other rows and three columns. For example, 
two further ways are given by 

\ A \ =: — a 12^12 + 022-^22 — 032^32 

= U31 A/3^ — U32A/32 "F ^33^33- ( 31 ) 


It is more convenient to introduce the cofactor of the element a tj . This 
is usually written A tj and is the signed minor of a tj so that A tj = ±M f} . 
The sign for the leading element is -F and the sign of any other cofactor 
is taken from the array of signs 



+ - + 


For example, 

A 23 = —M 23 = -1 fll1 ° 12 I and A 31 = +M 31 . 

I O31 <*32 I 

The determinant (29) may be written 

0u-4u + 012^12 4- 0 i 3 ^i 3 , evaluating along the first row, 
or a xl A lx + 021^21 + 031 ^ 31 , evaluating along the first column, 
or a X2 A X 2 + 022^22 + 032 ^ 32 , evaluating along the second column, 

or in three other ways. The six possible ways of evaluating the third 
order determinants are 


\A\ = a a A a + a i2 Ai 2 + a i3 A i3 , evaluating along the ith row ) 

= a u A lf + a 2i A 2i -F 03 ^ 3 ^ evaluating along they'th column/' ^ 

Extending this process to the fourth order determinant and using the 
elements of the first row, say, (27) can be written 

a n A n + a 12 A 12 + 013^13 + 014^14 (34) 


in which each of the cofactors A 1} - is a third order determinant that 
can be evaluated using (33). Note that (34) becomes 


022 023 

011 032 fl 33 

042 043 

or a xx a 2 2 


033 

043 


024 

034 

044 

034 

044 


+ . . . to 4 terms, 

I 032 034 I , I 032 033 1/ 
“ 023 + 024 / 

I 042 044 I I 042 043 |J 


+ . . . to 12 terms. 


or UijU 22033044 — 011022034043 -F 0 H 023034042 + • . . to 24 terms. 
When written out in full the last expression may be shown to be 
identical with the expression (26). 
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The evaluation of the nth order deter min ant is now defined by the 
extension of (33). Thus, 

Ml = + a i 2 A t2 + ... + a in A in (ith row) ) 

= fliMn + o 2 f A 2f + . . . + a nj A n1 (y'th column)/ ^ 

in which A tJ is the cofactor of a tl and is that determinant of (n — l)th 
order obtained by striking out the elements of the ith row and the y'th 

column of (28) prefixed with the sign taken from the array 

I + - + ■ • • (- 1)"- 1 I 

+ - + . . . 


I ( — l) n 1 . . . — + 

Repeated application of (35) will reduce any nth order determinant to 
n !/6 determinants of the third order or n 1/2 determinants of the second 
order prior to evaluation. In practice determinants of large order may 
be simplified by using the properties of the next section, evaluated by 
numerical methods, or may be avoided altogether. 

2 111 
0 2 10 

3 0 12’ 

1-12 1 

Solution Using the elements of the second row, the evaluation is 

i 

->| 2 1 1 |«- 2 1 

-0 + 2 3 1 2—1 3 ( 

I 1 2 1 1-1 


Example (i) Evaluate 


+ 0. 


Further evaluation along the row and column indicated leads to 


2 2 


1 2 I 

2 1 


- 1 


13 2 I 
1 1 


+ 1 


3 1 1 
1 2 j 


- 1-1 


= 2{2(—3) - 
= 2{—2} - i 
= -4. 


1 + 5} 


| 3 2 | , 

|l l| + ( 

-{-!(+!)+ 0+ 1} 


A useful method of evaluation for a third order determinant only is 
given by the Rule of Sarrus. Repeat the first two columns of the deter¬ 
minant as shown below and then form the two groups of triple products 
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one prefixed with the + sign and one with the — sign. It will be seen 
that the two groups consist only of the terms of (25) 



Example (ii) Evaluate 
Solution Write 



2 

7 

0 


4 3 2 14 3 

1-1 7 1—1 

-5 8 0 | -5 8 

and the evaluation is 


+ {(4)(-l)(0) + (3)(7)(—5) + (2)(1)(8)} 

- {(—5)(—1)(2) + (8)(7)(4) + (0)(1)(3)} 
= {0 - 105 + 16} - {10 + 224 + 0} 

= -89 - 234 
= -323. 


8.3.3 Some properties of determinants 

The results to be stated in this section are general results and may be 
applied to determinants of any order but illustrations are usually given 
by referring to the third order determinant 


with cofactors A, 


Ml 


<*12 Ol3 

a 2 i a 22 «23 
a 31 a 32 a 33 


Property 1 

If every element of a row of a determinant is zero then the value of the 
determinant is zero. 

This follows immediately from (35) by evaluating along the row of 
zero elements. 


Property 2 

The determinant of any square matrix is equal to the determinant of 
the transposed matrix, i.e. to any property concerning the rows of a 
determinant there is an equivalent property concerning the columns. 
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This result is clearly true for a second order determinant because 
<*n <*12 

<*21 <*22 

Consider 


<*11 <*21 
| <*12 <*22 | 


= 011^22 — <* 21 <* 12 - 


011 <*21 <*31 

M'| = <*12 <*22 <*32 

I <*13 <*23 <*33 

and evaluate using the elements of the first column, i.e. 


\A'\ — an 


<*22 <*32 

<*23 <*33 


— <*12 


<*21 

<*23 


= <*lMll + <*13^12 + <*13^13 


<*31 

<*33 


+ <*13 


<*21 

<*22 


<*31 I 
<*32 I 


= Ml- 


Property 3 

If each element of a row (column) of a determinant is multiplied by a 
scalar k then the determinant is multiplied by k. 

This follows immediately by evaluating the determinant along the 
row (column) containing the factor k. 

Example (i) 


<*11 

<*12 

<*13 


<*11 

ka 12 

<*13 

£<*21 

£<*22 

£<*23 

= £MI = 

<*21 

£<*22 

<*23 

<*31 

<*32 

<*33 


<*31 

£<*32 

<*33 


Property 4 

If any two rows (columns) of a determinant are interchanged, then the 
sign of the determinant is changed. 

Consider 

I <*31 <*32 <*33 

Ml = <*21 <*22 <*23 

| <*11 <*12 <*13 

obtained from \A\ by interchanging the first and third rows, and 
evaluate along the third row. 


1*1 = <*u 


<*32 <*33 

<*22 <*23 


<*31 <*33 

<*12 

| <*21 <*23 


+ <*13 


»31 

<*21 


<*32 

<*22 


— <*ll( — -^ll) ~ <*12(^12) + <*13( — A i3 ) 


= -Ml- 


Corollary to Property 4 

If any two rows (columns) of a determinant are identical then the 
determinant is zero because interchange of the rows (columns) would 
lead to a change of sign by this property and yet must at the same time 
leave the determinant value unchanged. 



Example (ii) 
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3 2 1 3 2 1 

1 32=2132=0. 
6 4 2 3 2 1 


Property 5 

The sum of the products of the elements of any row (column) with the 
cofactors of any other row (column) is zero. 


Consider |2?| = a u A 21 + a 12 A 22 + a 13 A 23 in which the elements of 
the first row of \A\ are multiplied by the cofactors of the second row. 


Then |5| = —a n 


All fll2 «13 

flu a 12 a 13 

a 31 a 32 a 33 


by the corollary to Property 4. 

This property may be generalized and, together with the cofactor 
evaluation (35) for the zth row, may be expressed in the single statement 


t a u Am 

y =i 


10 if / # k 
\\A\ if i = k. 


Employing the summation convention for a repeated index this may be 
written 


a n A kj — d (fc |^(| 


(36) 


where d ik is called the Kronecker delta and has the property that 


10 if i^k 
(l if i = k. 


(37) 


Using the elements of the y'th column in (35) a result similar to (36) 
may be stated 

a v A ik = d Jk \A\ (38) 


Property 6 

A determinant is unchanged in value if to the elements of a row 
(column) are added a scalar multiple of the elements of any other row 
(column). 

Consider 

I <*11 + ka 21 a l2 + ka 22 a 13 + ka 23 I 

l 5 l = 021 a 22 a 23 

I fl 3i O32 O33 I 
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then 


\B\ = (flu + ka 2 i)A 11 + (a 12 + ka 22 )A 12 + (a 13 + ka 23 )A 13 

= (fluAn + a 12 A 12 + a 13 A 13 ) + k(a 21 An + # 22^12 + 023 -^ 13 ) 

= Ml 


the last line following from (36). 

This property may be applied repeatedly to a determinant in order 
to reduce it to a simpler form before or during evaluation. 

Example (iii) Evaluate 

1 5 —3 R x 

2 7 0 R 2 . 

3 9 3 R 3 


Solution Perform the operations indicated 


1 

1 

1 


5 -3 
2 +3 
2 3 



0 by the corollary to Property 4. 


Example (iv) Evaluate 

I 1 1 1 I 

a b c . 
I a 2 b* c 2 I 

Ci c 3 c 3 

Solution Perform the operations indicated 


1 

0 

0 

->| 1 0 0 

a 

b-a 

c — a 

= (b — a)(c — a) a 1 1 

a 2 

b 2 - a 2 

c 2 — a 2 \ 

1 a 2 b + a c + a 

Ci 

C 2 - C, 

C 3 — C x 



= (b~ a)(c - a)(c - b). 


8.4 THE ADJUGATE OR ADJOINT MATRIX 

Before the inverse matrix introduced in section 8.2.9 can be finally 
defined it is necessary to examine a square matrix called the adjugate 
or adjoint matrix of A = a u written adj A and defined by 
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where A u is the cofactor of the element a it of the z'th row and yth 
column of the determinant \A\. Note that adj A may be obtained from 
A', the transpose of A, by replacing the elements of A' by their cofactors 
in \A'\. 


Example (i) If 
A = 


/4 2 1\ /4 3 0\ 

3 0 1] then A'= 1 2 0 l] 

\0 12/ \l 1 2/ 

/-! -3 2\ 

adj ,4 = -6 8 -1 . 

\ 3 -4 -6/ 


Let B be the matrix product of A and adj A then 


l a 11 a 12 a 13 • ■ 

I #21 a 22 023 • • 


i A 2 i A 31 


(40) 


and from the definition of matrix multiplication 

b ik = a lt A kj summed over j 

= <5, .Ml 

in the notation of (36), section 8.3. The property of d lk shows that B is 
a scalar matrix and 


A adj A = I 


l\A\ 0 0 

Ml 0 

o o 


Ml/ 


(41) 

Using (38) of section 8.3, an exactly similar result follows for (adj A)A 
so that 


Example (ii) If 


= \A\I. 

8.3, an exr 

A (adj A) = \A\I = (adj A)A. 

■(? 4). 


(42) 
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and then 

2)(_:d-g °h- 

4 ) = (» 9 -“ 

Example (iii) If 

,2 3 1\ n 0 1\ 

A = 11 2 01 and B= (0 2 1 
\0 12/ \l 1 1/ 

show that adj ( AB ) = (adj B)(adj A). 

Solution 

/3 7 6\ / 0 3 —3\ 

AB=(l 4 3 j and adj AB = I 3 -3 -3) 

\2 4 3/ \—4 2 5/ 

/ 1 1 — 2\ / 4 —5 —2\ / 0 3 — 3\ 

(adj B)(adj ^) = 1 0 — 11 (—2 4 11= 3-3-3 . 

\—2 -1 2/\ 1 -2 1/ \—4 2 5/ 


8.5 THE RECIPROCAL OR INVERSE MATRIX 

In section 8.2.9 the inverse matrix of a square matrix A was written in 

symbolic form as A' 1 such that 

AA- 1 = A' 1 A = I. (43) 


Comparison with (42) shows that (adj A)/\A\ is identical with A 1 , i.e. 


A ~ 1 = \J\* d i A (44) 

provided that |A| #0. 

If \A\ =0, the matrix is called singular and the inverse A -1 does not 
exist. 


Example (i) Given 



find A- 1 . 


Solution Here \A\ = 2(0) + 1(1) = 1 so that A is non-singular and 



12 1 1\ 

Example (ii) Given A = JO 11 0 find A -1 . 

\6 4 3 


Solution Here |A| = 0, A is singular, and A -1 does not exist. 
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8.6 SOLUTION OF REGULAR LINEAR ALGEBRAIC 
EQUATIONS 

Consider the system of n linear equations in n unknowns x, 

Ax = B (45) 

where A is the n x n matrix (a u ), x is the column matrix (x,) and B 
is the column matrix (b t ). If A is non-singular the system (45) is called 
regular. With A non-singular, A -1 exists and equation (45) may be 
written 


or x = A~ 1 B 

The equation (46) is then the formal solution of (45). 

Example (i) To solve 2x x + x 2 — x 3 = 4 

x x — 2x 2 + x 3 — —10 
- 3x x -2x 3 = 9 


(46) 


write 


and 


/ 2 1 -‘\ h\ i 4 \ 

= I 1—2 11, x=jx 2 J and B — I —101. 

\-3 0 -2/ W \ 9/ 

Since \A\ = (—3)(—1) + (-2)(-5) = 13, then 

”(:! 3 

M:i m- it3 


Hence, x x = — 1, x 2 = 3 and x 3 = —3. 

The solution (46) may be expressed in terms of determinants and 
cofactors. As an illustration consider the case n = 3, then 


i l Mn a 21 AtA/bA 

x = W \(*d)A)B- Wl \A 12 A 22 Ajj^bj 

lb x A xx + b 2 A 21 + b 3 A 31 \ 
= Mu + Ma + Ma 
11 \*iA 13 + b 2 A 23 + Mas/ 
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where A ti is the cofactor of the element a tj in A. Hence, 

JCi x 2 x 3 1 

a; = a; = a 3 = r 

° r w (4,) 

where A j = b l A ij (summed over /). 

In equation (47), \A\ is the determinant of the matrix A = (a tj ) and 
A] is the determinant of that matrix obtained from A on replacing the 
yth column of A by the column (b,). The solution of equation (45) 
expressed in the form (47) is called Cramer's rule. The result (47) may 
be extended immediately for the case n> 3. 

Example (ii) Solving example (i) by this rule leads to 


Xi_ _*2_*3_1 


4 1 -1 

— 

2 4 1 

~ ’ 

2 1 4 


2 1 -1 

-10 -2 1 


1 -10 1 


1 -2 -10 


1 -2 1 

9 0-2 


|-3 9 -2 


-3 0 9 


-3 0 -2 


It is left to the reader to evaluate each determinant and verify the 
solution. 


8.6.1 Solution by the elimination method 

A general method of dealing with the system of equations (45) is to 
eliminate unknowns x ( until one equation contains only one unknown, 
solve for this unknown, and then work back through successive equa¬ 
tions to find the other unknowns. A systematic method of procedure 
was suggested by Gauss. This method is sometimes laborious but being 
systematic, and of a routine nature, it is suitable for a computer 
program. 

Consider as an example the system of equations 


2*! — x 2 + 3x 3 = 

10 ] 


x 1 + 3x 2 — 4x 3 = — 

11 

(48) 

4x x + x 2 — 2x 3 = 

3) 


12 -1 3\/xA 

( 10 \ 


(1 3 — 4)(x 2 = 

i 11 

(49) 

\4 1 -2/W ’ 

l 3 / 


A x = 

B 



The matrices of (49) may be written in augmented matrix form (A, B) 


[2 -1 3 

10\ 

(l 3-4 

-n 

\4 1 -2 

3 


( 50 ) 
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arid because of the equality signs in equations (48) certain row manipu¬ 
lations may be performed on (50) without affecting the final solution. 
The three important row transformations are: 

I Interchange of rows, 

II Multiplication of a row by any non-zero constant, 

III Addition of rows. 

Apply these row transformations to (50) to make the matrix A into an 
upper triangular matrix, i.e. one with zero elements below the leading 
diagonal. The successive steps are indicated below. 




3-4 1- 

-11\ 

Apply I, 

* 

2 

-1 3 

io Lr 2 


^4 

1 “ 2 1 

3/ *3 



/I 3 - 

4 | -11\ Rx 

and then apply II and III 

(0 -7 11 32 1 R' 2 



\0 -11 14 | 47/ R' 3 

where 


K = R 2 ~ 2* x ; R' 3 = R 3 - 4 Ri 


/I 

3 -4 

-11\ 7?x 

Apply II and III again 

P 

-7 11 

32*2 


\o 

0 -23 

-23/ r; 

where 


R 3 = 1R' 3 • 

- 11*£ 

and a final application 

of II leads to 



/I 

3 -4 

-n\ 


P 

7 -11 

-32 . 


\o 

0 I 

1/ 


The solution may be written down working from the last row, i.e. 
x 3 = 1 , 7x 2 — 11*3 = —32, and Xx + 3x 2 — 4x 3 = —11 or x 3 = 1, 
*2 = — 3 and = 2. In a practical problem the coefficients of a 
system of algebraic equations are usually decimals and it is convenient 
to use II to make the coefficients of the variable being eliminated equal 
to unity. The numerical details of this, together with other methods of 
solution, is given in Chapter 13. 


8.6.2 Alternative method of determination of the inverse matrix A ~ 1 
If A is a non-singular matrix then the equation 

Ax = IB (51) 

has the formal solution 


Ix = A~ 1 B (52) 

which suggests that if A is transformed into / by the row transforms 
I, II, and III, then / will be transformed into A -1 by the same processes. 




174 


MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


The method is illustrated in the following example where ~ is a 
symbol used to indicate that successive matrices are obtained by row 
transforms. 


Example Find the inverse of 


Solution Write 


(A, /) ■ 


■£ 


£! 

| 1 0 0\R 1 

o i o r 2 

0 0 1/ r 3 


I 


/I 2 2 1 0 0\ R[ 

- o 1 2-1 1 o \ r ' 2 

\0 0-1 0 —1 1/^3 


where R' = R a - R 2 , K = 


1 0 
0 1 
fi 0 
R[ - 2R' 2 - 2 R'; R" 2 


R 1 ;R‘ : 
0 — 2 ' 
1 


1/ K 


A- 1 ). 


-- R' 2 + 2R' 3 ; R" = 


A' 1 


■(-! 3 - 


These steps are not the only ones by which (A, I ) can be transformed 
into (/, A~ l ) and the reader is invited to attempt the transformation 
using different steps and also to verify that the result obtained is the 
same as adj Aj\A\. 


8.7 FURTHER DISCUSSION OF THE SOLUTION OF n 
LINEAR EQUATIONS IN n UNKNOWNS 
The discussion of the system of n equations in n unknowns 

Ax = B (53) 

is given here without too much detail. The reader who requires a more 
formal treatment is advised to read this section and then to consult a 
text devoted exclusively to matrix theory. 


8.7.1 Regular solution, A non-singular. 

If the n x n matrix is non-singular, then \A\ =/ 0 and it is evident that 
Cramer’s rule, 

x i — Aj/i.d|, j = 1, 2, 3,. . ., « 

leads to n distinct values of x h i.e. to a unique solution. In this case 
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the n linear equations are said to be linearly independent. Some of the 
A y may be zero to give zeros in the corresponding x } , but the equations 
(53) cannot have a solution with all x } zero and A non-singular unless 
each Ay is zero, and this implies that B is zero. The equation (53) then 
reduces to 

Ax = 0 (54) 

This equation is called a homogeneous equation (the equation Ax — B 
being called non-homogeneous) and has a unique but zero solution if 
and only if A is non-singular. This solution of equation (54) is called 
the trivial solution. 

8.7.2 Singular solution, A singular 

Suppose the matrix A is singular. The \A\ = 0 and the form given by 
Cramer shows that if the Ay ^ 0 the x } cannot be found, i.e. no solution 
exists and the system of equations is said to be inconsistent. 

Example (i) The system 

*1 + *2 = 3 
2x a + 2x 2 = 7 

is obviously inconsistent. Here \A\ = 0, Ay = — 1, and A a = 1. 
Example (ii) The system of equations 

3*! + 2x 2 + x 3 = 10 
2x 1 + 3x 2 — *3 = 5 
4*i + x 2 + 3x 3 = 8 

has no obvious inconsistency but \A\ = 0, A a = 35, and A 2 , A 3 are 
also non-zero. 

A closer inspection of the two examples shows that in the left side 
of each system one of the expressions is dependent on the others, for 
example 

(4x x + x 2 + 3x 3 ) = 2(3x x + 2x 2 + x 3 ) — (2x 1 + 3x 2 — x 3 ), 
but that this is not the case for the right side, i.e. 8 =£ 2(10) — (5). 

Suppose now that all the Ay are zero, then Cramer’s solution 
x, = Ay|A| is of the form 0/0 which is indeterminate. 

Example (iii) The system 

3xy + 2x 2 + x 3 = 10 
2x x + 3x 2 — x 3 = 5 
4x x + x 2 + 3x 3 = 15 

has |v4| = 0 and A, = A 2 = A 3 = 0 and there is no unique solution. 
In this example the last equation is a linear combination of the other 
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two and the system reduces to one of two equations and three unknowns 
3x a + 2x 2 + x 3 = 10 
2x x + 3x 2 — x 3 = 5. 

The original system of three equations is said to be linearly dependent. 
One of the unknowns can be disposed of arbitrarily, say x 3 = «, and 
the others expressed in terms of a, i.e. 

= 4 — a, x 2 = a — 1, x 3 = a. 

Note that there is a solution to the system of equations although 
indeterminate to the extent that a is arbitrary. Any system that possesses 
a solution regular or indeterminate is called consistent. 

To sum up, a system of equations is either inconsistent (|v41 =0, 
0) or consistent and only the latter possesses a solution. A con¬ 
sistent system may have a unique solution (i.e. one fixed set of values 
for Xj) in which case the solution is called regular or it may have an 
indeterminate solution (i.e. an infinite number of different sets of 
values Xj). An illustration in which one variable may be assigned 
arbitrary values is given by considering the single equation 2x 1 + 3x a 
= 6 with two unknowns. Geometrically this is the equation of a straight 
line in the plane Ox x x 2 and the solution Xy = 3 — fa, x 2 = a gives 
the coordinates of an arbitrary point on the line. Similarly, the pair of 
equations above 

3*! + 2x 2 + x 3 = 10 
2xj + 3x 2 — x 3 = 5 

may be geometrically interpreted as the intersection of two planes in 
three dimensions (see section 9.7). The solution 

Xi = 4 — a, x 2 = a — 1, x 3 = a 

gives the coordinates of the general point on the common line of inter¬ 
section of the planes. Different values of a simply give different points 
on this line, e.g. (4, —1,0), and (10, —7, —6) are two such points. 

It is necessary to consider the general case. Without proof, it can be 
stated that the system is consistent if and only if r of the equations 
are linearly independent where r is any non-zero positive integer such 
that r < n and then (n — r ) of the unknowns can be disposed of 
arbitrarily. In order to investigate a system of equations the elimination 
method of section 8.6.1 may be used. Any inconsistency in the system 
will appear in the equivalent form of the augmented matrix (A, B). 
The following examples should illustrate the process. 

Example (iv) Discuss the system of equations 
2x t + x 2 + 5 x 3 = 1 
Xi — 3x 2 + 6x 3 = 2 
3x x + 5 x 2 + 4x 3 = 0. 
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Solution Write 




6 

-7 

0 


7? 2 - 27? x ; R 3 - 37? x 7?' 3 - 27?' 2 


The last row of zeros shows that the system is consistent but that one 
of the equations is linearly dependent on the others. One unknown may 
be arbitrarily assigned, let x 3 = a then lx 2 = 7* — 3 and lx x = 
5 — 21oc. Note that here n = 3 and r = 2. 


Example (v) Discuss the system 



Solution In this case 

/I "3 6 1 2V 

(zf,.5)~(0 7 -7 -3) 

\0 0 0 | 1 / 

and the last row shows that the system is inconsistent. Assuming that 
X\ and x 2 are non-zero the equation corresponding to the last row is 
Ox 3 = 1 which is impossible for finite x 3 . The system has no solution. 

Finally when B = 0 the system is homogeneous. Ax = 0 and the 
system is always consistent. If \A\ ^ 0 there is only one solution 
Xj — 0 (see section 8.7.1) called the trivial solution but if \A\ — 0 then 
(n — r) of the variables may be assigned arbitrary values where r is an 
integer such that 0 < r < n, and the remaining r variables obtained in 
terms of these assigned values. Thus the condition that the equation 
Ax — 0 has a solution other than the trivial one is that \A\ =0. 


Example (vi) Solve 

3*! + X 2 + 2*3 — X 4 = 0 

x x — x 2 — x 3 =0 

4xx + x 3 — x 4 = 0 
9x x — x 2 + x a — 2xi = 0. 
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Solution The matrix is transformed as follows 



( 1 -1 -1 0 \ /I -1 -1 0 \ 

0 4 5 "M- 0 4 5 - 1 

0 4 5 -1 0 0 0 0 

0 8 10 - 2 / \0 0 0 0 / 

and the last two rows show that the system has two equations that are 
linearly dependent. Two of the variables may be assigned arbitrary 
values, let x 3 = a and x 4 = /? then 4x 2 = /? — 5a and 4x 4 = f} — a. 

8.8 THE SOLUTION OF A GENERAL SYSTEM OF m 
LINEAR EQUATIONS IN n UNKNOWNS 
When m = n the system reduces to that already discussed in section 8.7. 
Consider now the two cases m <n and m> n. 

8.8.1 Case m < n 

Given the system 

Ax = B (55) 

A is an m x n rectangular matrix, x is an n x 1 column matrix and 
B is an m x 1 column matrix and there are more unknowns x, than 
there are equations. The system (55) is consistent if and only if r of the 
equations are linearly independent where r is a non-zero positive integer 
such that r < m. In the solution, (n — r ) of the unknowns may be 
assigned arbitrary values and the remaining r variables found in terms 
of these assigned variables. 

Example (i) Examine the system 

3*! + x 2 — x 3 + x 4 = 4 
Xi -f- 2 X 3 + 2x 4 = 5 
3x 2 + x 3 — x 4 = 3 

in which m = 3 and n = 4. 

Solution Proceeding by the elimination method 
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The equivalent form of (A, B ) shows that the system has no inconsis¬ 
tency, the equations are linearly independent, and r = m = 3. One 
(n — r = 1) of the variables may be arbitrarily assigned. Let x 4 = a 
then llx 3 = 18 - 7a, llx 2 = 5 + 6a and llx x = 19 - 8a. 


Example (ii) Examine the system 

*i + * 2 — *3 — Xi = 1 

2x x — x 2 + Xi = 2 
3x x — x 3 =3 
in which m = 3 and n = 4. 

Solution 




1 -1 -11 

1\ 

/I 1 -1 -1 

1\ 

-1 0 1 

2 I~ 

0-3 2 3 

°j 

0-1 0 

3/ ' 

\0 —3 2 3 
/3 0-1 0 

0/ 

3 \ 


-1 

0-3 2 3 

\0 0 0 0 



R s - R 2 ; 3 Rx + R 2 

which shows that the system is consistent but that one of the equations 
is linearly dependent on the other two, i.e. r = 2. Then two (n — r = 2) 
of the unknowns may be assigned arbitrary values. Let x 4 = a, x 3 = /?, 
then 3x a = 2/? + 3a and 3x x = 3 + (1. 


Example (iii) Examine the system 

x x — 2 x 2 — x 3 — x 4 = —4 
3x x + x 2 + x 3 — 2 x 4 = 11 

x x + 12 x 2 + 6x 3 + x 4 = 42 
in which m = 3 and n = 4. 

Solution 

ll -2 -1 -1 I —4\ /I -2 -1 -1 

(A,B)= 3 1 1 —2 11) —10 7 4 1 

\l 12 6 1 I 42/ \0 14 7 2 

/I -2 -1 -1 

~(0 7 4 1 

\0 0-1 0 

and the system is consistent, r = m = 3 and one (n — r = 1) of the 
variables may be arbitrarily assigned. The last equation formed from 
the equivalent matrix shows that x 3 = 0, let x 4 = a, then the solution 
is x 4 = a; x 3 = 0; 7x 2 = 23 — a and 7x x = 18 + 5a. 
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Example (iv) Examine the system 

Xi - 2 x 2 — x 3 - * 4 = -4 

3*4 + X 2 + *3-2*4= 11 

*4 + 12*a + 7* 3 + * 4 = 31 
in which m — 3 and n = 4. 


Solution 



sistent. 

8.8.2 Case m > n 

In this case the system (55) has more equations than unknowns. The 
system is either inconsistent in which case there is no solution, or it is 
consistent and the equations are linearly dependent. 

If the system is consistent then, as in section 8.8.1, r of the equations 
are linearly independent and (n — r) of the unknowns may be assigned 
arbitrary values. The non-zero integer r is such that r < n. 

Example (i) Examine the system 

2*4 +* 2=7 

*i + *2 = 5 
4*i + *a=ll 
5*i -*2=7 
3*i-2* 2 = 0 
in which m = 5 and n — 2. 

Solution 
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which shows that the system is consistent and that three equations are 
linearly dependent on the other two, i.e. r — 2, n — r = 0. Finally, 
x 2 = 3 and x x — 2. 


Example (ii) Examine the system 

2x x + x 2 = 5 
3x! + 2 x 2 = 3 
Xi+ x 2 = 2 


in which m = 3 and n = 2. 
Solution 


I 2 1 5 \ I 1 1 2 \ 

[,£)= 3 2 3 ~ 0 -1 1 

\l 1 2/ \0 0 -4/ 


The last row shows that the system is inconsistent. 


Example (iii) Examine the system 

3*! — x 2 + x 3 = 1 

+ 2x 2 — x 3 = 2 

lx 1 — lx 2 + 5x 3 = —1 

12 Xl + 3x 2 — 9 

7*i + * 3 = 4 

in which m = 5 and n — 3. 


Solution 


(A, B) 



At this stage it may be seen that the system is consistent, that three of 
the equations are redundant, i.e. r = 2, and that one (n — r = 1) of 
the unknowns can be given an arbitrary value. Let x 3 = a, then 
7*2 = 4a + 5 and 7*j = 4 — a. 

Finally, the condition that a system of (n + 1) equations with n 
unknowns is consistent may be expressed as a simple determinant. 
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Consider, for simplicity, the case of 3 equations with 2 unknowns, i.e. 
0n*i + a 12 x 2 = b 1 
«2i*i + 022*2 = b 2 
031*1 + a 3 2*2 = b 3 . 

Replace b, by tb { and the system may then be considered as a homo¬ 
geneous one of three equations with three unknowns, x u x 2 , and t. 
From section 8.7.2 the homogeneous system has a non-trivial solution 
if and only if the determinant of the coefficients is zero, i.e. 

I 0n 012 b x 1 

02i 022 b 2 = 0. (56) 

I 0 3 1 032 b 3 I 

Since t = 1, this must be the necessary condition for the original three 
equations in two unknowns to have a unique solution. The condition 
(56) can be immediately extended to the general case of n unknowns. 

8.8.3 Summary 

A system of linear equations Ax = B in n unknowns may be tested for 
consistency by carrying out a process of transformation of the matrix 
(A, B). The method involves an elimination of successive unknowns 
from the equations of the system by an application of row manipula¬ 
tions to the matrix (A, B). The process concludes when the matrix 
(A, B) has been replaced by the equivalent matrix (C, K) where AT is a 
column matrix and C is an upper triangular matrix, i.e. one whose 
elements below the leading diagonal are all zero. Any inconsistency in 
the system will be shown by a row of (C, K ) having all its elements zero 
except the element of the column K. If the system is consistent then the 
number of rows of (C, K) whose elements are not all zero is the number 
of equations of the system that are independent. The difference between 
the number of unknowns and the number of independent equations 
determines the number of unknowns that may be assigned arbitrary 
values. 

EXERCISE 8 

1 A = («(,), B = (b jk ), and C = (c kl ) are matrices oi order 7x3, 
3x4, and 4x5 respectively. The product ABC is given by 
(a tj b JK c kl ). Write out in full the element of the 4th row and the 
2nd column of ABC. 

2 Given X - ^1 4^, * = (j ~J). and C = (J i) : 

(i) evaluate ( AB)C and A(BC), 

(ii) verify that A(B + C) = AB + AC, 

(iii) find the matrix M such that B 2 + 2C 3 + 2 M = 0. 
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3 Given matrices A = (a w ) and B = (b Jk ) of orders m x n and 
rt x p respectively, write down: 

(i) the element of the ith row and the kth column of AB, 

(ii) the element of the kth row and the ith column of (AB)', 

(iii) the elements of the fcth row of B', 

(iv) the elements of the ith column of A'. 


Show that (AB)' = B'A'. 



show that (i) A 2 = I 

(ii )PQ=QP 

where P = AD X A, Q = AD 2 A, and D u D 2 are arbitrary diagonal 
matrices. 

6 Write down the cofactors of the elements/and k in the determinant 

la b cl 

\d e f\ 

I g h k \. 

12 3-11 

7 Evaluate 1 0 2 

I 4 2 1 I 

(i) using cofactors of the first row, 

(ii) using cofactors of the second column, 

(iii) by the rule of Sarrus. 

I 3 1 “ 2 I 

8 Given that 8 -5 7 = -35, 

I 4 0 1 I 

state, without expansion, the value of the following determinants 


1 3 8 

4 1 ! 

3 

-2 

-2 1 

1 3 1 -2 1 

(i) 1 -5 

l ( “l 

8 

7 

10 (iii) 

4-5 6 

1 -2 7 


4 

1 

o 1 

1 4 0 1 

(iv) 1 11 1 

-2 1 (v) 

1 3 

3 

10 1 


-20 -5 

7 

8 

-15 

~ 35 


| 0 0 

1 1 

1 4 

0 

—5 1 
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9 Evaluate 


1 -1 1 

0 2 3 

2 1 3 

4 0-1 


(i) by first forming the equivalent determinant whose elements of 
the second row are (1, 0, 0, 0), 

(ii) by first forming the equivalent determinant whose elements of 
the third column are (—1, 0, 0, 0). 

10 Find the inverse of the matrix 


/4 3 1 

A = [ 2 0-2 

\l -1 1 

by two different methods. 

11 Given the system of equations 

(i) 2x 1 + x 2 + x 3 = 6 (ii) 2x 1 + x 2 + x 3 = 6 
*i + 2x a + x 3 = 10 + 2x 2 + x 3 — 10 

IOjcj. - x 2 + 3x a = 2 7 Xl + 5;c 2 + 4x 3 = 7. 

Show that one is inconsistent and find the solution of the other. 


12 Solve, if possible, the systems 
(i) Xjl — 3x 2 + 4x 3 — x 4 = 3 

*i — x 3 + 2 x 4 = 1 

Xi — 8 x 2 + 1IX3 — 7x 4 = 7 


x 2 ■ 


3x 3 


(iii) 2Xi + 3x 2 = 7 

4xj + 5x 2 = 1 

2xj — x 2 = —19 

2xi + 4x 2 = 5 


(ii) 3xi - 2x 2 + x 3 = 
2x x + x 2 + 4x 3 = 2 
x x — 3x 2 + 2x 3 = 

= 0 4Xi + x 3 = 

(iv) Xi + 2x a + x 3 - x 4 = -2 
3*i — x 2 + 2x 3 = 2 

2xj + 5x 2 + x 3 + x 4 = 4 


13 Find the condition that ax 2 + bx + c = 
have a common root. 


0 and px 2 + qx + r = 0 


[Hint: Form 4 equations in 3 unknowns x 3 , x 2 , x and use the con¬ 
sistency condition.] 



9 

Vectors 


9.1 SCALARS 

Certain physical quantities are completely described by a magnitude, 
or a measurement in a given system of units, for example, a volume, a 
mass, a temperature, or a speed. Such quantities are called scalars and 
may be manipulated using the basic laws of algebra, which for scalars 
a, /?, and n are: 

The law of commutation 

(i) a + = p + a 

(ii) a/S = /?* 

The law of association 

(i) (a + /?) + // = a + (/S + jw) 

(ii) (a/?)/* = a(#u) = <xfti 
The law of distribution 

a(/3 + n) = a/3 + a p. 

9.2 DIRECTED MAGNITUDES 

Other quantities are not completely defined by a magnitude but require 
also a precise indication of the direction in which the measurement is 
taken. Typical examples are, a displacement, a force, a velocity, or a 
finite rotation. In the latter case, the direction is indicated by the axis 
of rotation and the magnitude by the angle of rotation. Such quantities 
may be conveniently represented on a figure by a line segment in the 
requisite direction and of a suitably scaled length to represent the 
magnitude. 

It is not necessary that these directed magnitudes have a prescribed 
point of application, for example, if a solid body moves without 
rotation in a straight line then every point of the body has the same 
velocity. Such quantities are said to be free. If a directed magnitude 
has its point of application prescribed as, for example, a given force 
acting at a point of a body then it is said to be bound. 

9.2.1 Notation 

When directed magnitudes are indicated on a diagram it is convenient 
-to use letters to label the figure and a directed magnitude may then be 

represented by a line segment OP, OP, or OP in Clarendon type. 
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The notation OP means that the direction of the measurement is 
from O to P and that the magnitude is equivalent to the length from 
0 to P. The direction in which a vector acts should be emphasized by 
adding an arrow to the line segment as in Fig. 9.1. 



Fig. 9.1 


9.2.2 Equality of directed magnitudes 

In Fig. 9.2 the two directed magnitudes OP and QR are equal, i.e. 
OP = QR, if, and only if, the lengths OP and QR are equal and they 
are in the same direction (note this means that not only are the lines 
of action parallel but that the magnitudes are directed in the same sense). 
If two directed magnitudes have the same magnitude but are in opposite 
directions, as are OP and ST in Fig. 9.2, then this is indicated by 
writing OP = —ST. In particular, OP = —PO. A null or zero directed 
magnitude is one of zero magnitude in any direction. 




9.2.3 Addition of directed magnitudes 

Suppose a man walks 1 kilometre due East from a given point and then 
walks in a direction N 30° E for 1 kilometre, his path may be represented 
by the displacement diagram Fig. 9.3 where O represents his starting 
point, Q his final position, and P his intermediate position. 

His path represented by OPQ is equivalent to <JQ, i.e. walking \/3 
kilometre in a direction N 60° E from his initial starting point. The 
equivalence of paths may be formally written 
OQ = OP + PQ 
and called the triangle of addition. 


(1) 
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It is immaterial in which order the directed magnitudes of displace¬ 
ment are taken because, assuming it to be physically possible, the man 
could have walked on a path represented in the figure first by the dis¬ 
placement OR and second by RQ. From the definition of equality, 
OR = PQ and OP = RQ and it follows that 

OQ = OP + PQ^=OR + RQ = PQ + OP= OR + OP (2) 
When the parallelogram OPQR is completed, equations (2) state that 
the sum of OP and OR is given by the diagonal OQ. This is called the 
parallelogram law of addition. Equations (2) also show that displace¬ 
ments obey the commutative law of addition. In the example given the 
various paths taken all lie on the surface of the Earth. The distances 
involved are so small compared to the radius of the Earth that the 
curvature of the latter may be neglected. The points represented by 
O, P, Q, and R then all lie on a plane. In comparison see example 2 of 
Exercise 9. 

However, all directed magnitudes do not combine according to the 
triangle or parallelogram law of addition. Consider for example the 
addition of two finite rotations as follows. A point P (see Fig. 9.4) is 
situated at the position whose Cartesian coordinates are (1, 0, 0) and 
the line OP is subjected to a rotation (RJ through an angle of tt/2 
about the axis of Ox 3 and to a rotation (R 2 ) through an angle of 7r/2 
about the axis of Ox 1 . The two rotations in the sense indicated in the 
figure can be represented by directed magnitudes OB and OA (where 
the lengths OA and OB are both it/ 2). The rotation (Aj) followed by 
the rotation ( R 2 ) makes P move, via Q, into S and is therefore equiva¬ 
lent to a single rotation about the axis Ox 2 . 

If, however, the order is reversed so that rotation (R 2 ) is followed by 
rotation (/?!) it is seen that P moves to Q. The conclusion is that in this 
example the sum OA + OB is not the same as the sum OB + OA; the 



MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


directed magnitudes of finite rotations do not obey the parallelogram 
law of addition or the commutative law of algebra. 



9.3 VECTORS 

A directed magnitude that obeys the parallelogram law of addition so 
that, in Fig. 9.3, 

OQ = OP + OR = OR + OP 

is called a vector. Examples of vectors are displacement, force, velocity, 
acceleration, and angular velocity. From the definition it follows that 
a finite rotation is not a vector. In addition to the notation of section 
9.2.1, a vector is frequently denoted by a single letter such as a or b 
printed in Clarendon type. The non-negative scalar magnitude of a 
vector a is then denoted by |a| or simply a in italic type. 

9.3.1 Equality of vectors 

Two vectors a and b are equal if, and only if, a = b and the directions 
are the same. If a = b but the directions are opposite then a = — b. 

9.3.2 Zero vector and unit vector 

A null or zero vector is of zero length and has no particular direction; 
it is frequently denoted by 0. A unit vector is one whose magnitude is 
unity. 


9.3.3 Addition and subtraction of vectors 

If more than two vectors are added together then the addition law is 
applied successively as illustrated in Fig. 9.5 until a single vector (the 
resultant) is obtained. 
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It may be easily shown, using the properties of parallelograms, that the 
order of continued addition is immaterial and that 
R = (a + b) + c = a + (b + c) 

so that vectors obey the associative law of addition. The vectors of 
Fig. 9.5 do not necessarily lie in one plane. 



Fig. 9.7 


The difference of two vectors a and b is simply the sum of the vector 
a and the vector (—b). In Fig. 9.6, a is added to (—b) using the parallelo¬ 
gram rule. 

If (a — b) is a free vector then it may be moved to any parallel position 
and in Fig. 9.7 the sum and difference of two vectors a and b are repre¬ 
sented by the diagonals of the parallelogram with sides a and b. 
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(Note that the vector a — b is in the direction from the end of b to 
the end of a.) 

Example (i) If the vectors a and b are inclined at an angle 0, then 
|a — b| = \/(a 2 + b 2 — lab cos 0). 
and ja + b| = \/(a 2 + b 2 + lab cos 6). 

Solution This follows by an immediate application of the cosine rule to 
the triangles of Fig. 9.7 where angle AOB is 0. 

Example (ii) Prove the inequalities 

l« - b| < |a| + |b| 

and |a-b|>||a|-|b||. 

Solution In the triangle OAB of Fig. 9.7 it is clear that 
BA < OA + OB 

and the first result follows immediately. 

Further, if a > b write 

OA*Z OB + BA 
or BA^OA- OB, 

i.e. |a-b| > |a| - |b|; 

if a < b write 

OB < OA + AB 
or AB ^ OB — OA, 

i.e. |a-b|>|b|-|a|. 

The second result then follows. Obviously both equalities cannot hold 
at the same time unless either a or b is the null vector. 

Example (iii) A car ferry crossing a sea loch travels 0-5 kilometre 
in 3 minutes. The ferry is observed to move at an angle of 60° to the 
current which flows uniformly at 5 kro.p.h. Find the velocity of the ferry 
relative to the water. 

Solution In Fig. 9.8(a) the velocity v of the water compounded with the 
velocity V of the ferry relative to the water must give the resultant 
velocity n in the direction AB X or AB 2 . The magnitude of the velocity u 
is 10 km.p.h. (given). 

The vector diagrams in Figs. 9.8(b) and 9.8(c) show that the required 
velocity in each case is u — v. If the ferry travels from A to B x , then 

V = V x = V(5 2 + 10 a + 2.5.10 cos 60°) 

= 5^7 km.p.h. 

and tan 6 X — 10 sin 60°/(5 + 10 cos 60°) 

= V3/2 

i.e. 6 X = 40° 54'. 
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If the ferry travels from A to B 2 then 

V = V 2 = V(5 2 + 10 2 - 2.5 . 10 cos 60°) 
= 5\/3 km.p.h. 

and tan 0 2 — 10 sin 60°/(5 — 10 cos 60°) 

i.e. 0 2 = 90°. 


Bz fi. 



(b) (c) 

Fig. 9.8 


9.3.4 Multiplication of a vector by a scalar 

If A is a scalar quantity then 2a is a vector of magnitude |Aa|, i.e. |A|a 
whose direction is that of a if A is positive and that of —a if A is negative. 
The Fig. 9.9 shows a free vector a together with three vectors 2a, —3a, 
and £a. 



Fig. 9.9 


If a is an arbitrary vector then a/|a| is a unit vector in the direction 
of a. A notation in common use is to reserve a for this vector. Thus 
a = |a|a. (3) 


= aa 


or 
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Using the properties of similar triangles and simple proportion it 
may be shown that if X and p are two scalars and a and b are two 
vectors then 

= /^ a = ^Oa) (4) 

(X + ft)a. = Aa + //a (5) 

2(a + b) = Aa + Xb (6) 

Equation (4) is the associative law and equations (5) and (6) form the 
distributive law for multiplication of a vector by a scalar. 

Example Express in vector notation that (a) P is the mid-point of OA, 

(b) CD is parallel to AB, three times as long as AB, and in the opposite 

sense to AB. _ 

Solution (a) OP = \OA, and vector OP has the same line of action as 
OA. Hence, OP = \OA. 

(b) CD = 3 AB and hence CD = —3AB, the opposite direction 
being indicated by the negative sign. 

9.3.5 Components of a vector 

Given any vector r, applying the addition rule of section 9.3.3 in reverse 
it is possible to write 

r = a x + a 2 + . . . + a n 

where the n vectors a, and the vector —r when put head to tail form 
a closed polygon as in Fig. 9.10. 



Fig. 9.10 

This polygon is, in general, a three-dimensional one. The vectors 
a 1; a 2 , . . ., a n are called the component vectors of r. Any (n — 1) of 
the vectors a ( may be chosen arbitrarily but the nth vector must close 
the polygon. 
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A case of particular interest arises when the vector r is expressed as 
the sum of three vectors parallel to three given non-zero, non-coplanar 
vectors a, b, and c. In Fig. 9.11, let OR represent the vector r and con¬ 
struct the parallelepiped with OR as diagonal and the sides OA, OB, 
and OC parallel to the given vectors a, b, and c respectively. This 




construction is unique, that is, only one such parallelepiped can be 
drawn and 

r = OR = OA + AD + DR 
= OA + OB + OC 
= Aa + fib + vc 

where A, p, and v are determinable scalars. In particular if a, b, and c 
are unit vectors then A, p, and v are the number of such units in OA, 
OB, and OC respectively. The component vectors of r are Aa, pb, and 
vc but the scalar quantities A, p, and v are frequently referred to as the 
scalar components of the vector r or, when no ambiguity arises, simply 
as the components of r. 

The case in which the vectors a, b, and c are mutually orthogonal is 
of special importance because the three directions of these vectors can 
be identified with those of the orthogonal Cartesian coordinate axes 
Ox, Oy, and Oz. Unit vectors in these directions are usually indicated 
by i, j, and k respectively. If r is a displacement vector then using 
Fig. 9.12, r may be written 

r = xi + yj + zk 

where (x, y, z ) are the coordinates of the point R relative to the point 
O. If O is the origin of the coordinates then the displacement r is called 
the position vector of the point R. 
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If V is any vector it may be written in the form 
V = Vj + V 2 \ + V 3 k 

where V u V 2 , and V 3 are the orthogonal (scalar) projections of the 
vector V on to the coordinate axes. 

An alternative notation commonly used when the Cartesian axes are 
in the form (xj, x 2 , x 3 ) is to replace (i, j, k) by (e^ e 2 , e 3 ) and then 

r = X& + x 2 e 2 + x 3 e 3 = j x^ 

and V = K iei + V 2 e 2 + V 3 e 3 = | V ( e,. 

There is only one rectangular parallelepiped that can be constructed 
with sides parallel to i, j and k and with a given diagonal OR so that 
the resolution is unique and it follows that if two vectors V and U are 
equal then their components are equal. 



Thus given that V = Vj + V 2 j + K 3 k and U = UJ + U 2 j + U 3 k are 
equal, so that V = U, then 

Vi = U u V 2 = U 2 , and V 3 = U 3 . 

Note 1 An even further abbreviated notation is to write r = xfii, the 
convention being that a repeated index means a summation taken 
over all possible values of that index. 

Note 2 A vector may be conveniently represented by its components 
in the form (x 1; x 2 , x 3 ) or 
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providing that there is no ambiguity or confusion. The vector com¬ 
ponents then form a 1 x 3 row matrix or a 3 x 1 column matrix 
and may be used in conjunction with other matrices. For example if 
x = xfr and y = yfi, and A is the 3x3 matrix a tj then 

y = Ax 

is the vector equivalent to the matrix equation 
y = Ax 

of example (ix) of section 8.2.6 where y T = y' = (y 1 y 2 y 3 ), 
x T = x' = (*! x 2 x 3 ) and 

( Oil a l2 <*13\ 
o 2 i a 22 a 23 1. 
a 3i o 32 a 33 ) 


9.3.6 Direction cosines 

Suppose a general vector V is represented by ~PQ as in Fig. 9.13 and 
that V = VJ + V 2 \ + K 3 k, then by Pythagoras’ theorem 

V = |V| = PQ = y/(PB 2 + BQ 2 ) 

= v /(PA 2 + AB 2 + BQ 2 ) 

= V(V\ +v 2 + V 2 ) (7) 



Fig. 9.13 
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The direction of PQ can be specified by the angles that PQ makes with 
the coordinate axes Ox, Oy, and Oz (or with PA, PC, and PD). These 
angles are usually designated a, /?, and y respectively and are called the 
direction angles. The angles are measured positively in the sense indi¬ 
cated in Fig. 9.13. 

It is, however, the cosines of these angles that frequently occur and 
these are designated A, ft, and v where 
A = cos a. 

ft = cos fi (8) 

v — cos y 

and called the direction cosines. From the right-angled triangles of 
Fig. 9.13 it follows that cos a = PAjPQ, cos fl = PCjPQ, and 
cos y = PD/PQ or 

cos a = Fi/Fl 

cos/? = V 2 /v\ (9) 

cosy = F 3 /fJ 

Since the position of Q relative to P is uniquely determined by only 
3 quantities (viz. V x , V 2 , and F 3 ) and 4 quantities have been introduced 
(viz. A, ft, v, and F) there must exist some relationship between these 
quantities. Using equations (7), (8), and (9) leads to 

A 2 + p + = i 

or cos 2 a + cos 2 /? + cos 2 y = 1 (10) 

Instead of the actual direction cosines it is often sufficient to use any 
three quantities proportional to the direction cosines. Let these be 
/, m, and n so that 

/ = kk\ 

m = k/t\ (11) 

n = kv) 

The quantities defined by (11) are called direction ratios and are often 
written in a square bracket [l:m:n] or [/, m, n\. In this notation, [1, 2, 3] 
would be the direction ratios of a vector whereas (1, 2, 3) would be 
the Cartesian coordinates of a point in space. 

From (10) and (11) 

k = V(/ a + m 2 + « 2 ) (12) 

and hence 

A = / / V(? 2 + m 2 + n 2 ) 
fi = m/y/(l 2 + m 2 + n 2 ) 
v = nl V(/ 2 + m 2 + n 2 ). 
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To obtain the direction cosines from the direction ratios it is therefore 
necessary to divide by \/(l 2 + m 2 + n 2 ). In particular, if 
V = VJ + V 2 j + F 3 k 

then [Kj: V 2 - V 3 ] may be used as direction ratios. 

Example (i) Find the sum of the two vectors 3i + 7j — 4k and 
6i — 2j + 12k and calculate its magnitude and direction. 

Solution Sum vector = R = 9i + 5j + 8k 

Magnitude = R = V(81 + 25 + 64) 

= V170. 

Direction ratios are [9:5:8] and the direction cosines are (9/V170, 
5/V170, 8/V170). 

Example (ii) Find the direction of the line PQ where P is the point 
(3, —7, 8) and Q is the point (2, 5, 5). 

Solution If O is the origin then in vector form 
OP = 3i - 7j + 8k 
and OQ = 2i + 5j + 5k 

and hence PQ = OQ — OP 

= -i + 12j - 3k. 

The direction cosines of the line PQ are then 

(-1/V154, 12/V154, -3/V154). 

Example (iii) Three forces F x = 2i + 3j + 2k, F 2 = 2j — 3k, and 
F 3 = 4i — 2j — k act on a point mass. Find the resultant force. 
Calculate the angles it makes with the coordinate axes. 

Solution Let the forces act on the point mass at the point P in Fig. 9.14. 



Fig. P 9.14 


The resultant R = F x + F 2 + F 3 
= 6i + 3j - 2k. 

The magnitude of R is \/(36 + 9 + 4) or 7. 



198 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 
The direction cosines are (6/7, 3/7, —2/7) and the direction angles are 
« = cos- 1 (6/7) =31°, 

P = cos' 1 (3/7) =64° 37', 

and y = cos -1 (—2/7) = 106° 36'. 

(Note here the value of y corresponding to the negative sign for cos y.) 

Example (iv) Prove that the centroid of any triangle divides the median 
in the ratio 2:1. 



Solution In Fig. 9.15 let M and N be the mid-points of AC and AB 
respectively of triangle ABC and let G be the centroid. 

Let CA = b and CB = a,, then BA = b — a. BN = £(b — a), CM = |b. 
Let CG = aCN where a is a scalar, then 

CG = «(CB + BN) 

= «(a + Kb - a)) 

= ^(a + b). 

Let BG = pBM, then 

BG = /3(BC + CM) 

= /?(—a + K>). 

Now since BG = BC + CG = — a + CG, it follows that 
|8(—a + ib) = -a + ^ (a + b) = R (say). 

The vector components of R are unique and hence 
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from which it follows that 

« = P = *■ 

The result then follows immediately. 

9.3.7 The transformation matrix 

The transformation matrix of section 8.1 can now be given a geo¬ 
metrical significance. Suppose that P is a point that has Cartesian 
coordinates (b lf b 2 , b 3 ) when referred to a frame of axes Ox ± x 2 x 3 and 
has coordinates {B x , B 2 , B 3 ) when referred to a second frame Oy l y 2 y 3 
obtained from the first by a rotation about O as illustrated in Fig. 9.16. 



From sections 9.3.5 and 9.3.6 it follows that 

B x = sum of projections of b lt b 2 , and b 3 on to Oy x 
= b x cos (xjOjj) + b 2 cos {x 2 Oy^) + b 3 cos (x a Oyj) 

= A±bi + fiib 2 + v x b 3 

where [A 1; fi u j^] are the direction cosines of Oy x referred to the frame 
Ox 1 x 2 x 3 . Likewise 

B 2 — X 2 b 1 + fi 2 b 2 + v 2 b 3 
and B 3 = A 3 bj + fi 3 b 2 + v 3 b 3 

where [?. 2 , \x 2 , v 2 ] and [X 3 , fi 3 , v 3 ] are the direction cosines of Oy 2 and 
Oy 3 respectively. 
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In matrix notation 



In an exactly similar manner a relationship can be derived in the form 

© lh A 2 2 3 \ /BA 

= Ij«i H i«3 lUa) (14) 

Vi v 2 v 3 } \bJ 

where [X u X 2 , A 3 ] are the direction cosines of Ox 1 referred to the frame 
Oyxy 2 y 3 . In a more concise notation the equation (13) may be written 


or B = Ab 

where A = (« tf ) is the transformation matrix and 
a tf = cos (xjOyt). 

Similarly equation (14) may be written 
bi = 

or b = ^4'B 

where A' is the transpose of A. 



Thus the transformation matrix (a u ) represents the rotation of the 
frame of reference Ox x x 2 x 3 into the frame of reference Oy 1 y 2 y 3 and 
has elements which are direction cosines of one frame with respect to 
the other, i.e. it specifies the magnitude and direction of the rotation. 
Alternatively, if the frame OxiX 2 x 3 is fixed then (a tJ ) represents the 
rotation in space of the line OP through an angle d, to take up a 
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position OQ where P is the point whose coordinates are (b 1 , b 2 , b 3 ) 
and Q is the point whose coordinates are (B u B 2 , B 3 ) and OP = OQ. 
This is illustrated in Fig. 9.17. 

Now let x be a vector in the direction of OP, i.e. x = / b, and let y 
be a vector in the direction of OQ, i.e. y = sB, where t, s are scalars, 
then 

y = Cx 

where C = (c tj ) 

= 7 (*«,)• 

The matrix C contains all the information necessary to give the relative 
positions and magnitudes of the vectors x and y. 

Note that solving equation (13) for b leads to b = A _1 B and hence 
it follows that 


or AA' = I. 

Matrices such as A which satisfy this relationship are called orthogonal 
and have an important role in matrix application to engineering prob¬ 
lems, particularly in connection with those involving vibration. 

Note also that in suffix notation the equation y = Cx may be 
written 

Ji = CijXj 

where ( x } ) and (y t ) are column matrices of order (3x1) and (c w ) is 
necessarily a square matrix of order (3 x 3). 

9.4 VECTOR EQUATION OF A STRAIGHT LINE 

The direction of a straight line in space is determined by its direction 
cosines (or ratios) and its position is fixed in space if the coordinates 
of any point on the line are known. 



Fig. 9.18 
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Suppose a straight line has a direction specified by a constant vector 
b (i.e. in a Cartesian coordinate system by b = b x i + b 2 \ + b 3 k so that 
the line has direction ratios [b x , b 2 , Z> 3 ]) and passes through a fixed 
point A in space whose position vector relative to an origin O is a (i.e. 
in Cartesian coordinates a = aj + a 2 j + a 3 k so that A has coordinates 
(oj, a 2 , a 3 )). In Fig. 9.18 the points A and B are given (for a given a 
and b) and the straight line is obtained by producing the line segment 
AB in both directions. 

The equation of the line is obtained by stating mathematically the 
geometrical condition that the variable point P is constrained to lie on 
AB and on AB (or BA) produced. Let OP = r be the position vector 
of P, then in Fig. 9.18 

OP = r = OA + AP. (15) 


But, from section 9.3.4 

AP = /AB 

= /b (16) 


where / is some scalar. 

For fixed b, different values of t give different points along the line. 
From equations (15) and (16) the required equation is 

r = a + /b. (17) 

It is important to note that the vector equation (17) is independent of 
any system of coordinates. The scalar quantity t is a parameter that can 
range from — oo to +oo. If / > 1, P lies in AB produced; if / = 1, 
P coincides with B\ if 0 < / < 1, P lies on AB\ if / = 0, P coincides 
with A and if / < 0, P lies on BA produced. 

In a rectangular Cartesian coordinate system 
r = xj + x 2 \ + * 3 k, 
a = flii + a 2 j 4- a 3 k, 
and b = bji + b 2 j + b 3 k 

so that equation (17) becomes 

Aii + x 2 \ + x 3 k = (flji + a 2 j + a 3 k) + t(bA + b 2 j + Z> 3 k). 

From section 9.3.5 this is equivalent to three scalar equations 
= fli + tb x 

x 2 = a 2 + tb 2 (18) 

x 3 = a 3 + tb 3 . 

The set of equations (18) are called the parametric or freedom equations 
of the line. Rearranging (18) gives the usual form of constraint equation 
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of the line 


*1 — fli _ -* 2 — «2 _ -* 3 — a 3 _ 

bx ~ b 2 ~ b 3 ~ l 


(19) 


It is worth noting that if the line lies in the two-dimensional plane 
Ox x x 2 then a 3 = 0 and, since the line is at right angles to the x 3 -axis, 
b 3 = 0 so that equation (19) reduces to 



The equations (20) are equivalent to 


*i ~ Qi 
bi 


Xz-a* 

~bT~’ x 3 


= o. 


( 21 ) 


If (x lt x 2 , x 3 ) is replaced by (x, y, z) the equation (21) becomes 
y — a 2 = b 2 (x — a 1 )/b x , z — 0 
which is equivalent to the well-known form 
y = mx + c, z = 0. 

Example (i) Find the equation of the line passing through the two 
points R and S whose Cartesian coordinates are i?(3, 2, +1) and 



Solution The direction of the line is specified by the join of the two 
points. In the notation of Fig. 9.19 

a = 3i + 2j + k, 
r = xj + x 2 \ + x 3 k 

and c = 6i - 3j + 4k 

so that RS = b = c — a 

= 3i - 5j + 3k. 

The vector equation of the line is then in the form 

r = a + /b 

which, using equations (17) and (19), leads to 
*i-3 _ x 2 -2 _ x 3 - 1 
3 -5 ” 3 


(= 0 


( 22 ) 
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Note that the vector equation could have been written using the point 
S instead of the point R, the equation would then have taken the form 
r = c + ib 


where s was the parameter. The equation equivalent to (22) would then 
have been 


xi — 6 x 2 +3 x 3 — 4 , 

3 = ~^5~ = 3 = J 


(23) 


That equation (23) is the same as equation (22) is immediately evident 
if s is replaced by t — 1. 

Example (ii) Find the equation of the line through (0, 3, 2) and parallel 
to the join of (4, 3, 1) to (10, 3, —2). 

Solution The join (4, 3, 1) to (10, 3, —2) has direction ratios [6:0: —3]. 
Write a = 3j + 2k and b = 6i — 3k then the vector equation is 
r = a + fb which leads to the scalar equations 


Note that the zero in the direction ratio shows that the line is perpen¬ 
dicular to the axis of y. Each point of the line has a y-coordinate of 3. 


9.5 THE POSITION RATIO WITH REGARD TO TWO 
BASE POINTS 

Given two points A and B the position ratio gives the coordinates of 
the point P that divides the join of A to B internally in the ratio p :q. 



Fig. 9.20 


In Fig. 9.20 let O be the origin and let OA = a, OB = b be the position 
vectors of the given points A and B. Let P divide the join A to B in 
the ratio p.q then 


or 


APjPB — plq 
AP/AB = pKp + q) 


(24) 



In vector form, 
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AB = b - a 

and AP = {pj(p + q)}(b - a). 


The position of P is determined by the vector OP or r where 
r = OA + AP = a + {p/(p + q)}(b - a) 

= (pb + qa)/(p + q) (25) 


If a = xj + yj + Zik; b = x 2 i + y 2 j + z 2 k; and r = xi + y\ + zk 
then equation (25) leads to three scalar equations for the coordinates 
of P, namely, 


p + q ’ 


py 2 + qyi 


pz 2 + qz x 
p+q 


(26) 


If the join A to B is divided externally in the ratio p :q the equation (24) 
is replaced by 

AP/AB = p/(p - q) 


and equation (26) is replaced by a similar equation with (—q) instead 
of q. 


Example Find the coordinates of the point dividing the join of (1, 2, 3) 
to (5,7,8), (a) internally in the ratio 3:2, and (b) externally in the ratio 


Solution 

(a) With (*j, y u z : ) = (1,2, 3) and (x 2 , y 2 , z 2 ) = (5, 7, 8) the point has 
coordinates 


3.5 + 2. I 
*- 3 + 2 - 

3.7 + 2.2 
y ~ 3 + 2 

3.8 + 2.3 
Z 3 + 2 


= 12 
5 

= 5 
= 6 


(b) Using p:q = 5: — 7 gives the coordinates (—9, —21/2, —19/2). 


9.6 PRODUCT OF TWO VECTORS 

So far only the addition of vectors and the multiplication by a scalar 
have been considered, but any vector algebra is lacking unless some 
meaning can be attached to the product of vector quantities. It is 
evident that the ordinary ideas of (scalar) multiplication cannot be 
sufficient because of the directional properties possessed by vectors. In 
order to give a motivation for the subsequent definition of the product 
of two vectors, consider now the multiplication of a force vector F and a 
displacement vector d. Suppose for simplicity that the force F is constant 
(i.e. constant in both magnitude and direction) and that it acts at a 
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point P as in Fig. 9.21. Consider the work done by the force as the point 
P moves to a point Q where the displacement vector PQ is denoted by d. 
Let the angle between F and d be 0, then the work done by F is defined as 
the product of the displacement and the force component in the direc¬ 
tion of the displacement, i.e. (F cos 6)d. This scalar quantity Fd cos 0 
has been arrived at by forming, in a particular way, the ‘product’ of the 
two vectors F and d. 

As a second illustration of the product of two vectors, suppose that 
the force F acts at one end Q of a rigid rod PQ whose other end P is 
freely pivoted. Consider the moment of the force about the point P. In 
Fig. 9.21 the magnitude of this moment is defined as the product of the 
force and the length of the perpendicular from P to the line of action of 
F, i.e. F(d sin 0). The moment of the force has, however, a direction 
associated with it as well as a magnitude and is a vector quantity. The 
direction of the moment of F about P is perpendicular to the plane of 



Fig. 9.21 

F and d and in the sense of the longitudinal movement of a right- 
handed screw laid in that direction and rotated in the direction that F 
is attempting to turn PQ about P. In Fig. 9.21 if F and d are in the 
plane of the paper then the moment about P is perpendicular to the 
paper and directed away from the reader. Hence it is possible to arrive 
at a vector quantity by forming, in a particular way, the ‘product’ of 
two vectors F and d. 

9.6.1 The scalar product 

Given two vectors a and b the scalar product (sometimes called the dot 
product) is defined as the scalar quantity ab cos 0, where 0 is the angle 
between the two vectors and is usually written a . b, i.e. 

a . b = ab cos 0 
= a(b cos 0) 

= b(a cos 0) 

= b. a. 


(27) 
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Note that if a and b do not lie in the same plane then 0 is the angle 
between two vectors parallel to a and b respectively drawn through 
any given point. The scalar product may be positive or negative 
depending on the angle 0. If 0 = 0 the vectors are parallel and 
a . b = ab, in particular a . a = a 2 = a 2 . 

The scalar product may be used to find the projection of a vector 
in any given direction. In Fig. 9.22 the scalar projection of the vector 
b in the direction of a is PN, and from equation (27) 



b. n 

Likewise the projection of b in any direction n is-. Note particu¬ 

larly that b and n do not have to be coplanar vectors. The scalar product 
obeys the following laws of algebra: 


The commutative law, 

a 

. b = b . a 

(29) 

The associative law, 

a(a. 

b) = oca. b = a . ab 

(30) 

The distributive law, a , 

> (b + 

c) = a . b + a . c. 

(31) 


If equation (31) is divided by a then the distributive law states that the 
component of (b + c) in the direction of a is equal to the sum of 
the component of b in the direction of a and the component of c in the 
direction of a. The distributive law may be extended in the form 

(a + b).(c + d) = a.c + a.d + b.c + b.d. (32) 

From equation (27) it is easily seen that the scalar product a . b vanishes 
not only when either a or b is zero but also when cos 0 is zero. Thus, 
two non-zero vectors whose scalar product vanishes must be perpen¬ 
dicular. In particular if i, j, and k are the unit vectors along the ortho¬ 
gonal Cartesian coordinate axes then 

i.i = j.j = k.k = l (33) 

and i. j = i. k = j . k = 0. (34) 

It follows immediately, using equations (32), (33), and (34), that if 
a = aji + a 2 j + u 3 k and b = + b 2 j + b 3 k, then 

a . b = (ad + a 2 j + a 3 k) . (bj + b 2 j + b 3 k) 

= ajhi + a 2 b 2 + a 3 b 3 . 


(35) 
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Example (i) Find the scalar product of a = 2i — 3j + 7k and 
b = 3i + j — 5k. 

Solution Here, 

a.b = 6 — 3 — 35 

using equation (35), i.e. 

a . b = -32. 

Example (ii) Find the projection of a = 2i — 3j + k on to b = i + 5k. 
Solution From equation (28) the projection of a in the direction of b 

7 

V26 

a = 2i — 3j + k and b = 

3i + 5j — 2k. 

Solution From equation (27), 

cos 0 — a. b lab 

= (6 - 15 - 2)/(V14)(V38) 

= -0.4768 

and 0 = 118° 29' = 180* - 61° 31'. 

Note that in any given problem it may be sufficient to find the acute 
angle between the two vectors, i.e. to ignore any negative sign. Strictly 
the angle of 61° 31' in the example above is the angle between a and — b. 

Example (iv) The point of application of the force lOi + 7j + 8k 
newton moves a distance of 3 metre in the direction of (a) the vector 
2i + 5j + 3k, and (b) the vector i + 2j — 3k. Find the work done by 
the force in each case. 

Solution Let the force be F and the displacement be d then the work 
done in each case is F . d. 

(a) The unit vector in the direction of 2i + 5j + 3k is 
(2i + 5j + 3k)/V38, therefore 

d = 3(2i + 5j + 3k)/V38 
and the work done by the force is 

W = F. d = (lOi + 7j + 8k). (2i + 5j + 3k)(3/y'38) 

= (20 + 35 +'24)3/V38 
= 237/V38 joule 

(b) The displacement vector in this case is 

d = 3(i + 2j - 3k)/V14 

and the work done is 

W = F . d = (10 + 14 - 24X3/ V14) 

= 0 joule. 


V(1 + 25) 

Example (iii) Find the angle between 
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No work is done by the force when it moves in a direction at right 
angles to its line of action. 

Example (v) Show that the angle in a semi-circle is a right angle. 

Solution It is required to show that in Fig. 9.23 AP and BP are at 
right angles for any position P on the given circle centre O, diameter 
AB. 



Write OB = a and OP = b, then AO = a and AP = AO + OP 
= a + b, and BP = BO + OP = -a + b. 

The scalar product 

AP . BP = (a + b). (-a + b) 

= —a . a + b . b 
= b 2 - a 2 

= 0, since OP — OB. 

The scalar product is zero and since AP and BP are non-zero they are 
at right angles. 

9.6.2 The vector product 

Given two vectors a and b the vector product (sometimes called the 
cross product) is another vector c written c = a x b such that 

(i) c is at right angles to both a and b, 

(ii) |c| = ab sin 6, where 0 is that angle between a and b such that 
0 < 0 < 77 , 

(iii) a right-handed rotation about c through an angle 0 would move a 
into the direction of b. 

The vector product may also be written 

a x b = (ab sin 0)n (36) 

where n is a unit vector, ]n] = 1, in the direction of c (Fig. 9.24). 
These properties imply that the vector b x a is identical with — c, or 
b x a = -(a x b). (37) 

The vector product does not obey the commutative law of algebra and 
it is essential in any manipulation of vectors involving the cross product 
that the order of the vectors is strictly maintained. 
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The vector product a X b is zero when either of a, b is zero or when 
sin 0 = 0. Thus two non-zero vectors have a zero vector product when 
they are parallel. 

If a and b are at right angles then a x b = abn. The base unit vectors 
i, j, k, of a right-handed Cartesian coordinate system are mutually 
orthogonal and have the following properties 


i x j 

= -j x i = k 

(38) 

j X k 

= -k x j = i 

(39) 

k x i 

= -i x k = j 

(40) 

i x i 

= jxj = kxk = 0. 

(41) 


Although the vector product does not obey the commutative law it 
may readily be seen that it can be associated with a scalar a in the form 
a(a xb) = (aa) x b = a X (ocb). (42) 



The vector product satisfies the distributive law in the form 

ax(b + c) = axb + axc (43) 

but the proof of this law is not obvious. (For a geometrical proof the 
reader is advised to consult a book devoted to vector analysis.) 

The Cartesian form of a x b may be derived as follows. Suppose c 
of Fig. 9.24 represents a X b and the Cartesian forms of the vectors are 
a = aj + a 2 j + « 3 k 
b = bji + b 2 j + b 3 k 

with c = a x b = ab sin On = Cji + c 2 j + c 3 k. (44) 

Thus c = (flji + a 2 j + <z 3 k) x (b\i + b 2 j + 6 3 k) 

= a x i x (hii + b 2 j + b 3 k) 'j 
+ a 2 j x (b 1 i + b 2 j + b 3 k) 1 

+ a 3 k X (bj + b 2 j + 0 3 k)J 

using the distributive law (43). 


(45) 
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A further application of (43) and the use of equations (38)—(41) leads 
to 

c = a X b = (a 2 b 3 — a 3 b 2 )i + {a 3 b 1 — a L b 3 )\ + {aj ) 2 — a 2 b x )k (46) 
or, in determinant form, 

I i j k I 

a X b = «! a 2 a 3 . (47) 

I h b 2 b 3 | 

Example (i) Find the vector product of a = 3i + 2j — k and 
b = i - 4j - 2k. 

Solution Here, 

I i j k I 

a x b = 3 2 -1 =-8i + 5j-14k. 

| 1 -4 -2 | 

Example (ii) Find the equation of the straight line passing through the 
point (3, 4, 5) having a direction perpendicular to the lines whose 
direction ratios are [1:0:2] and [—1:2:3]. 

Solution The direction of the line is to be perpendicular to the vectors 
a = i + 2k and b = — i + 2j + 3k. This direction is parallel to the 
vector 

I i j k | 

a X b = 1 0 2 = -4i - 5j + 2k. 

| -1 2 3 | 

The equation of the required line is given in vector form by 
r = (3i + 4j + 5k) + /(—4i - 5j + 2k) 
where r = xi + y\ + zk, or in scalar form by the equations 

x — 3_ y — 4 _ z — 5 

—5 ~ 2 


9.6.3 The matrix representation of a X b 
Let a x b = c = c,e { 


then the components of a, b, and c all form (1 x 3) row matrices. It 
is of interest to note that the vector product may be expressed in a 
matrix notation by 
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This may be further reduced into a concise matrix notation given by 

C i — e ifk a )bk 

in which i,j, and k are suffixes that take all possible values 1, 2, and 3 
and e ijk is defined as follows, 

e ijk = 0, if any two suffixes have the same value, 

= +1, if the suffixes have the cyclic order 123, 231, or 312, 

— —1, if the suffixes have the acyclic order 132, 321, or 213. 
For example, 

c 2 = e 2jk a } b k (summed over all values of j, k ) 

= e 231 a 3 b x + e 213 a x b 3 + terms in which e 2jk is zero 
= (+l)a 3 b x + (~l)a x b 3 
= a 3 b x — a x b 3 . 

Note that the suffixes i,j, and k are simply symbols that take the values 
1, 2, or 3 and have no connection with unit vectors i, j, and k. 

9.7 THE PLANE AND THE STRAIGHT LINE 

A plane is a surface in space for which the join of any pair of points 
of the surface is a straight line lying entirely in the surface and perpen¬ 
dicular to a given direction known as the normal to the plane. 



Suppose, as in Fig. 9.25, that A lies in the plane and that the position 
vector of A is OA = a. Let P be a variable point such that OP = r. 
Let c be the vector giving the normal direction, then the join of A to P 
lies entirely in the plane passing through A if AP is perpendicular to c, 

1C ' AP . c = 0 

or (r — a) . c = 0. 


(48) 
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Equation (48) is then the equation of the plane and is a scalar expres¬ 
sion. 

In Cartesian form, if 

a = aj 4- a 2 j + a 3 k, 
c = li + mj + «k, 
and r = xi + y\ + zk, 

then the plane equation has the form 

l(x — a x ) + m(y — a 2 ) + n{z — a 3 ) — 0 (49) 

From equation (48), the equation of any plane whose normal is in the 
direction of c has the form 

r . c = /? (50) 

or lx + my + nz — /?. (51) 

In another notation the plane equation has the form 

« 11*1 + « 12*2 + « 13*3 = bi 

and is a linear, algebraic equation of the type discussed in Chapter 8. 
Geometrically, a plane in space will usually meet a second plane in a 
common line of intersection. This line will then meet a third plane in 
a single point. Thus three planes will in general meet in a single point 
of intersection. The discussion of section 8.7 concerning the solution of 
n linear, algebraic equations in n unknowns has a simple geometrical 
interpretation when n = 3. The regular solution of section 8.7.1 is 
equivalent to the case of three planes meeting in a unique point of 
intersection. The equation (54) of section 8.7.1 is simply the case of 
three planes through the origin. The inconsistent case of section 8.7.2 
is geometrically equivalent to the case of three planes whose three lines 
of intersection form the edges of a triangular prism, i.e. the lines are 
parallel. The indeterminate case of section 8.7.2 is equivalent to the 
three planes having a common line of intersection. 

Many problems concerning the straight line and the plane can be 
dealt with using vectors and a few problems are solved below. 

Example (i) Find the equation of the family of planes whose normals 
are in the direction of the vector 2i — 3j + k. Find that member of 
the family passing through the point (3, —1, 2). 

Solution From equation (51) the family equation is 
2x — 3y + z = P 

The particular plane through the point (3, —1, 2) has the value of ft 
given by 


that is 


2(3) - 3(—1) + (2) = 

,3 = 11. 
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The required equation is then 

2x — 3y + z = 11. 

Example (ii) Find the equation of the plane containing the line 
X — 3 y — 1 __ z + 1 
2 “ 1 - 3 ’ 

and parallel to the line 

x-l y+2 z+4 

1 ~ 3 “ -1 ' 

Solution The plane must pass through the point (3, 1,-1) and have 
its normal perpendicular to the directions [2:1:3] and [1:3: — 1]. If the 
normal vector is c then 

I » j k I 

c = 2 1 3 = -lOi + 5j + 5k 

Jj 1 3 -1 | 

and the equation of the plane is 

-10(x - 3) + 5(> - 1) + 5(z + 1) = 0 
or 2x - y - z = 6. 

(Note that the point (1, —2, —4) is not used in determining the equa¬ 
tion.) 


Example (iii) Find the coordinates of the point of intersection of the 
line r = a + fb with the plane r . c = ft. 


Solution A straight line meets a plane in one point, in general. The 
point will be determined when the line and plane have a common 
position vector, i.e. when 


(a + rb) . c = ft 
or a . c + r(b . c) = ft. 

The value of t is then given by 


ft — a.c 
b. c 


if b . c ^ 0, 


and substitution into the equation r = a + tb gives the required 
position vector. 

If b . c = 0 the line is perpendicular to the normal, i.e. parallel to the 
plane, and there is no point of intersection (except at infinity) unless 
ft also has the value a . c (which implies that a lies on the plane) and 
then t is indeterminate, the line lies in the plane, and every point of the 
line is a solution. 
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Example (iv) Show that the lines 



x+9 y-6 z + 4 

4 -2 1 


intersect and find the coordinates of the point of intersection. 

Solution Usually two lines in space are skew (i.e. they do not intersect) 
and the condition for intersection is simply that they have a point in 


Let the parameter of the first line be t and that of the second line 
be s, then the condition that there is a common point is the condition 
that the equations 

5 + 2t = -9 + 4s 

1 + t= 6-2s 

2 + 3t = —4 + s 

are consistent. The condition for consistency from equation (56) of 
section 8.8.2 is that 

I 14 2 4 1 

-5 1 -2 =0. 

| 6 3 1 | 

The reader can easily verify that this is so and also that t = — 1 (or 
s = 3). Using the first equation with t = — 1, the intersection point is 
given by x = 3, y = 0, and z = —1. 

Example (v) Find the shortest distance between the lines 

x-3 _y+l_z-4 ^ x + 4 _ y + 2 _z -1 



Solution In Fig. 9.26 the shortest distance is given by RS the common 
perpendicular. The direction of the perpendicular is given by n where 
n = b x B, and b and B are vectors in the direction of the lines. 
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Let P and Q be points on the lines with position vectors a and A 
respectively, then the length RS is the projection of PQ on to n. Since 
PQ = A — a then 


(A-a).n 
n 

Here, a = 3i - j + 4k, b = 2i + j + k, 
B = 3i — j — 3k, hence 


RS = 


A = 


—4i - 2j + k, 


n = 


The shortest distance is 


I * j k I 

b X B= 2 1 1 

I 3 -1 -3 I 

= —2i + 9j - 5k. 


and 


(_7i - j - 3k) . (-2! + 9j - 5k) 
V(4 + 81 + 25) 


= 20 /^/ 110 . 


9.8 MOMENT OF A VECTOR ABOUT A POINT 

If a force acts on a particle at a point Q then the moment of the force 
about a point P is d x F where d is the position vector of Q relative to 
P as in Fig. 9.27. The direction of the moment is perpendicular to the 
plane of d and F and in the right-handed sense. The magnitude of the 
moment is Fd sin 0. 



Example A force of magnitude 4 newton acts at the point (3, 1,-4) 
parallel to a line whose direction ratios are [—1:1: —1J. Find the 
moment of the force about the point (1, 0, —2). Distance units are in 
metres. 


Solution Write a = i — 2k, b = 3i + j — 4k. The force vector is 
F = 4(—i + j — k)/y3 newton. The moment M is given by 
M = (b - a) x F 

= (2i + j - 2k) x (-i + j - k)(4/V3) 



= 4(i + 4j + 3k)/y3 newton metre. 
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It may be noted that M has three components (4/y'3), (16/V3), and 
(12/\/3) respectively, in the directions of the coordinate axes. 

The concept of the moment of a force about a point can be extended 
to any vector quantity although the extension may not always have any 
physical significance. For example, if v is the velocity vector of a 
particle of mass m at the point P and r is the position vector OP then 
mv is the linear momentum vector and r x mv is the moment of 
momentum (or angular momentum) of the particle about the origin O. 
This quantity is fundamental in mechanics. 

9.9 THE PRODUCT OF THREE VECTORS 

The product of two vectors is either a scalar or a vector quantity and 
in each case may be multiplied by a third vector. 

Given two vectors b and c, the product b. c is a scalar, A say, and 
may be multiplied by the third vector a to give (b . c)a or Aa which is 
simply a vector of length Xa in the direction of a. The product of a 
with the vector product b x c is of more interest and has many applica¬ 
tions in mechanics and other fields. 

9.9.1 The triple scalar product 

The product of a with b x c may be evaluated in the form a . (b x c) 
and the result, being a scalar, is called the triple scalar product of a, b, 
and c. Let a, b, and c be drawn at a point P and the parallelepiped 
constructed as in Fig. 9.28. 



Let 6 be the angle between b and c, and </> the angle between a and the 
normal to the plane containing b and c. By definition 
b x c = (be sin 0)n 

where n is a unit vector in the direction of the normal to the plane 
containing b and c in the right-handed sense with respect to b and c. 
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The triple scalar product is given by 

a . (b x c) = a . {be sin 0)n 
= {be sin 0)(a . n) 

= {be sin 6){a cos <f>) 

= {be sin 6)PN 

where PN is the ‘height’ of the parallelepiped. Since be sin 0 is the area 
of the ‘base’ parallelogram PBDC, the triple scalar product may be 
represented as the volume of the parallelepiped with sides a, b, and c. 
If 

a = a x \ + a 2 j + a 3 k, b = bj + b 2 j + b 3 k and c = cj + c 2 j + c 3 k, 
then 

I i i k I 

bxc = U 1 b 2 b 3 I (52) 

I Ci c 2 c 3 | 

and 

a.(bxc) = (o 1 i + a 2 \ + a 3 k) . {Aj + A 2 j + ^ 3 k) 

where A lt A 2 , and A 3 are the cofactors of I, j, and k in determinant (52). 
Evaluating, 

I «i <*2 o 3 | 

a . (b X c) = b 2 b 3 . (53) 

I Cl Cj c 3 | 

From the properties of interchange of rows in the determinant (53) 
it follows that 

a. (b X c) = b . (c x a) = c. (a x b). (54) 

The commutative property of the scalar product shows that 

a . (b X c) = (a x b) . c. (55) 

The brackets in (55) are frequently omitted and then 
a.bxc = axb.c 

which shows that the . and x of the triple scalar product are com¬ 
mutative. Notations in common use are (a, b, c) or [a, b, c] or simply 
abc. Although the brackets are frequently omitted, the reader is advised 
to retain brackets until confidence has been gained in vector manipula¬ 
tion. Note that a quantity such as (a . b) x c is meaningless because 
(a . b) is a scalar quantity. 

From the property of the determinant (53) 

(a, b, c) = —(a, c, b). 
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The triple scalar product vanishes 

(i) if any of the vectors is zero, 

(ii) if, in Fig. 9.28, cos <f> = 0, i.e. if <j> = -rr/2, 

(iii) if any one vector is a scalar multiple of either of the other vectors. 
This follows immediately from the property concerning equal rows 
in the determinant (53). 

From (iii) the triple scalar product vanishes if any two of the vectors 
are parallel. From (ii) the necessary condition that three non-parallel 
vectors are coplanar is that the triple scalar product vanishes. 

Example (i) Find the triple scalar product of the vectors 2i — 7j + 3k, 
i — j, and 3i + 4k. 

Solution The triple scalar product is 
12-7 3 1 

1 _l 0 = 3(3) + 4(5) = 29. 

| 3 0 4 | 

Example (ii) Find the condition that the two lines r = a + rb and 
r = A + jB intersect. 

Solution An inspection of Fig. 9.26 shows that the lines will intersect if 
the vectors (a — A), b, and B are coplanar. The condition for this is 
simply 

(a — A) . (b X B) = 0 

which is 

I Q\—Ai bi Bi I 

a 2 —A 2 b 2 B 2 = 0. 

I ci 2 — A 2 b 2 B 2 | 

9.9.2 The triple vector product 

The product of a with b X c may be evaluated in the form a X (b x c) 
and the result, being a vector, is called the triple vector product of 
a, b, and c. With reference to Fig. 9.28, 

b x c = (be sin 0)n 

and a x (b X c) = (be sin 0)(a x n) 

= (be sin 0)(a sin <f >)m (56) 

where m is a unit vector perpendicular to a and n in the right-handed 
sense. This vector m, and hence a x (b x c), is in the plane of b and c 
and from Fig. 9.11, with a a null vector and OR in the plane of b and c, 
it follows that 


a X (b x c) = pb + vc 
where p and v are scalars. 


( 57 ) 




220 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 
Likewise, 

(a x b) x c = -c x (a x b) 

= aa + £b (58) 

where a and (5 are scalars. 

The results (57) and (58) show that, in general, a x (b X c) and 
(a x b) x c are different vectors. The position of the bracket in a 
triple vector product is vital to the definition and must be retained. The 
quantity a X b x c is not fully defined. 

To find the value of p, and v of equation (57) first form the scalar 
product of both sides with the vector a, i.e. 

a . {a x (b X c)} = p(a . b) + v(a . c) (59) 

The left side of (59) is a triple scalar product [a, a, (b x c)] with two 
equal vectors and is therefore zero. Hence, 

£-ris-'<“*> <“> 

and substitution in (57) gives 

a x (b x c) = /{(a. c)b - (a. b)c}. (61) 

The equation (61) is to be true for all vectors a, b, and c. Suppose 
a = aj + a 2 j + a 3 k 
b = bj + b 2 j + b 3 k 
and c = c x i + c 2 j + c 3 k, 

then b X c = (b 2 c 3 b 3 c 2 )i + ( b 3 c± — bjC 3 )j + ( b±c 2 — b 2 Ci)k 
and the component of a x (b x c) in the x-direction is 

a 4t>\C 2 — b 2 C\) — a 3 (b 3 ci — biC 3 ). (62) 

The component of (a. c)b — (a, b)c in the x-direction is 

(a 1 c l + a 2 c 2 + aaC^bx — (aA + a 2 b 2 + aab^Cj, (63) 
Comparison of (62) and (63) shows that in (61) 

t = 1 

and that finally 

a x (b x c) = (a . c)b — (a . b)c. (64) 

Example (i) Verify the result (64) if a = 2i — j + k, b = i — j, and 
c = i + j + k. 

Solution Here, 

I 1 j k I 

b x c = 1 -1 0 =-i-j + 2k. 

1 1 1 I 
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The left side of (64) is 

I * j k I 

a x (b x c) = 2-1 1 = -1 - 5j - 3k. 

|- 1-1 2 | 


With (a . c) = 2 and (a . b) = 3, the right side of (64) is 
(a . c)b - (a . b)c = 2(i - j) - 3(i + j + k) 
= — i — 5j — 3k 


which is identical with the left side. 


Example (ii) Given x x a = b and x . c = a, find x. 

Solution The triple vector product of c, x, and a is, from (64) 
c X (x X a) = (c. a)x — (c. x)a 
= (a. c)x — oca. 


Put x x a = b then 


and so 


c x b = (a. c)x — aa 
(c X b) + aa 

X =-;-c-- 

(a.c) 


Example (iii) Prove that (a x b) . (c x d) = (a. c)(b. d) — (a . d)(b. c). 

Solution First consider (c x d) as a single vector V, say, then 
(a x b). (c x d) = (a x b).V 
= (a,b,V) 

= a . (b X V) 

= a. [b x (c x d)] 

= a . [(b . d)c - (b . c)d] 

= (a . c)(b. d) - (a . d)(b . c). 


9.10 DIFFERENTIATION OF VECTORS 

In most physical applications the vectors involved are not constants 
but are variable with respect to some scalar quantity. This scalar is 
frequently the time. For example the velocity v of a rocket being pro¬ 
jected from the Earth is clearly a function of the time since blast-off. 

9.10.1 Differentiation with respect to a scalar 

Suppose a is a vector which is a function of a scalar u, this is written 
a = f(n) or a(«). (65) 

Let u change to u + Su such that a changes to a + <5a where a + da 
is a (w + du). The derivative of a with respect to u is defined in the same 
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way as for the scalar functions of Chapter 2 by the limiting value of 
the ratio of (5a to du and written 


da 

du 


a (u + du) — a (u) 


( 66 ) 


The ratio of (5a to du is in the form of a scalar (l/du) multiplying a 
vector <5a and is a vector in the direction of da. In Fig. 9.29, OP repre¬ 
sents the vector a and OQ the vector a + da so that PQ represents the 
vector da. 

In general the vector 5a is in a different direction from that of the vector 
a. Write PQ = PQs where s is a unit vector in the direction of PQ, then 



Fig. 9.29 



the limit process of (66) involves both the limit of the magnitude ratio 
( PQ/du ) and the limit of the direction of the vector s. Let the latter be 
the unit vector q then 



that is, the derivative dajdu is in the direction of q. Note again that 
this direction is not necessarily in the direction of a. 

Consider the particular case when a is the position vector of a 
moving particle P. Replace a by r and u by the time t, then in Fig. 9.30 

OP = r = r(r) 

and 5r is the displacement PQ of two points on the path of the particle. 
The limit of PQjdt is the speed of the particle and the limiting direction 
specified by the unit vector q is that of the tangent to the space curve 
at P. But speed with an associated direction is simply the velocity; this 
is written 

dr . 

7, = r - , = v ' I' 


( 68 ) 
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Similarly, the derivative of the velocity may be evaluated to give the 
acceleration f, i.e. 


, d\ d 2 1 

f = dt = dT 2 = y = 


v(r + St) - v(t) 

; st 


(69) 


Using Fig. 9.29 with a replaced by v it is evident that the acceleration 
vector is not necessarily in the direction of the velocity vector. In 
general the vectors of position, velocity, and acceleration are in three 
different directions. 

The derivative of a vector has been defined in the same way as for 
a scalar function and the following results may be obtained 


Jd a+b > 

s* 

s; ( "- b) 

ad a * b) 
ad* x o x «)} 


da db 

du du 


(70) 

du ^ du 


(71) 

da db 

~t . b + a. -j- 

du du 


(72) 

(£ b .‘M-£' 


(73) 

da , db 

-r x b + a x -y 
du du 


(74) 

^x(bxc) + ax ( 

:s*<) + .x( 

hx|) 



(75) 


It is important that the order of the vectors be unchanged whenever a 
vector product is involved. 

Example (i) A particle moves so that its position at time t is given by 
r = 2 sin ti + 3 cos t\ + t 2 k where i, j, and k are constant base vectors. 
Show that 

r + r = (t 2 + 2)k. 

Solution If r = 2 sin ti + 3 cos tj + t 2 k 

then r = 2 cos ti — 3 sin t\ + 2tk 

and ¥ = —2 sin ti — 3 cos tj + 2k 

= -(r - t 2 k) + 2k. 

Hence r + r = (t 2 + 2)k. 

This result is a vector differential equation and represents three scalar 
equations of the type 

x + x — 0, y + y = 0, and z + z = (t 2 + 2). 
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Example (ii) What can be deduced about the acceleration of a particle 
that moves such that r x r is constant? 

Solution Let c be a constant vector such that 
r x r = c 

then, differentiating and using (74), 

rxr + rxf=0. 

Since r x ris zero, by definition, the acceleration is such that 
r x f = 0. 

Either the acceleration is zero for all time or the acceleration vector is 
parallel to the position vector, that is the acceleration is directed to¬ 
wards (or away from) the origin. The particle is therefore moving under 
the action of a central force. 


If, in equation (72), a = b then 


du 

- a 2 and it follows that 


(a . a) = 2a. - 


da _ da 
8 ' du~ a du 


(76) 


Hence if a is a vector of constant magnitude, i.e. |a| = a = a constant, 
then da/du = 0 and equation (76) shows that 

a . da/du = 0 (77) 

from which can be deduced the fact that a and da/du are perpendicular. 
Note that this is all that can be deduced from (77), as in general \da/du\ 
is not zero. In particular, da/du = 0 but \da/du\ ^ 0. 


9.10.2 Differentiation of a unit vector in two-dimensions 

Consider a unit vector p at the origin of coordinates O at time t. 
Suppose p makes an angle 0 with some given direction and that in a 
small time interval dt the vector takes up a new position at angle 
0 + 66 (Fig. 9.31). 


o 



o 


Fig. 9.31 
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Because |p| = |p + <5p[ = 1 = constant, then dpjdt is at right angles to 
p in the direction of the unit vector q. The magnitude of dpjdt is 


dp 

dt 


Lim 

Lim 

<sf—o 

Lim 


PQ 

dt 

2 sin (<50/2) 
dt 

sin (<50/2) <50 
(dO/2) dt 


dt 


and so 


dp 

dt 


(78) 


Any vector a may be written in the form ap where p is a unit vector 
in the direction of a. Differentiating and applying (71) and (78) leads to 


d a d 

dT = dtW 


(79) 


The equation (79) shows immediately that 



and that \da/dt\ and ( da/dt ) are different quantities, unless 6 = 0, i.e. 
unless a is in a constant direction. 

9.10.3 The velocity and acceleration components of a point moving in 
two-dimensions 

Let OP = r be the position vector of a moving point at time t. In many 
applications it is necessary to express the velocity and acceleration in 
Cartesian or polar coordinates. 

In Cartesian coordinates 

r = *i -f-j-n 

v = r = xi + j>j 1 (81) 

and f = v = f = xi + >’jj 

In polar coordinates, write 


r = rp 
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where p is a unit vector in the direction of r then from equation (79) 

• = t=(§)p + (^)q 

= rp + r0q. (82) 

The result (82) states that the velocity v may be resolved into two 
components at right angles, r in the radial direction and r6 in the 
transverse or cross-radial direction. These components together with 
the Cartesian components are illustrated in Fig. 9.32(a). 



Differentiating equation (82) again 

f = * = ? = rp + r ~ + r0q + r6 + r0q 

= rp + rd q + rd q + r $q + r& 2 (—p) 

on noting that df\fdt has a magnitude of 0 and a direction at right angles 
to p in the sense of increasing 6, i.e. in the direction of — p. 

Finally, 

f = (f - r6 2 ) p + (2 r6 + r6) q. (83) 

The two components of the acceleration are then r — rd 2 in the radial 
direction and 2 rd + rd or - — (r 2 &) in the transverse direction. These 

acceleration components are illustrated in Fig. 9.32(b). 

If the point P moves in a circle of radius a so that |r| = r = a the 
polar acceleration components are given by (—afl 2 ) in the radial 
direction and ad in the cross-radial direction. If further, the angular 
speed 6 is constant then the cross-radial acceleration component 
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vanishes and the radial component of (— ad 2 ) remains. Thus a point 
moving round a circle with a constant angular speed experiences an 
acceleration directed towards the centre of the circle, the so-called 
centripetal acceleration. 


9.11 EIGENVALUES AND EIGENVECTORS 
In 9.3.7 it was observed that the relationship y = Cx could be con¬ 
sidered as a transformation process, the vector x being transformed into 
the vector y by the matrix C. The two vectors are of the same order 
but are, in general, of different magnitudes and directions. The matrix 
C is necessarily square. 

Suppose that the matrix C is given. Then of all the possible trans¬ 
formations there exists a vector x which will be transformed into a 
scalar multiple of itself, that is, such that 

Cx = Ax (84 ) 

where A is a scalar. 

A particular transformation such as (84) is of interest in the mathema¬ 
tical sense but it is also of interest to the engineer when it is found that 
equations of this form play an important role in engineering problems 
(see section 17.3.3 of Chapter 17). 

The equation (84) may be written 

(C - A/)x = 0 (85) 

where / is the unit matrix of the same order as C. 

Suppose that x is represented as the column matrix fo, x 2 , x 3 ) T and 
C = (c w ), with i,j = 1,2, 3 then (85) is equivalent to the set of equations 

(cn — K)x x + c 12 x 2 + c 13 x 3 = 01 

c 2 i*i + (c 22 - X)x 2 + c 23 x 3 = 01 (86) 

Ca i*i + c 32 x 2 + (c 33 - X)x 3 = oj 

The system (86) has a non-trivial solution if the determinant of the 
coefficients is zero [see the end of section 8.7.2], 
i-e. |C — Al| = 0 (87) 

or 

c n ~ A c 12 c 13 I 

c 2 i C 22 - A c 23 j = 0 (87a) 

Tu c 32 c 33 - A | 

The equation (87) leads to a cubic equation in A. This means that, in 
general, there are three different values of A and hence three different 
vectors x for which (86) has a non-trivial solution, that is for which x 
is transformed into a scalar multiple of itself by the matrix C. The alert 
reader will observe that two of the roots of the cubic equation may be 
complex and will also note that it is possible that there are equal roots. 
It is assumed here that all the roots are real. 
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The values of A obtained by solving (87) are called the eigenvalues of 
the matrix C (sometimes called characteristic values or latent roots), 
and the associated vectors x are called the eigenvectors (sometimes 
characteristic functions). The equation (87) is called the characteristic 
equation. 

Example (i) Find the eigenvalues and associated eigenvectors if 

( 2-1 0 

-1 5 -1 

0-3 2 


Solution The eigenvalues are the roots of the equation 


2 — A -1 0 

-1 5 -A -1 
0 -3 2 -A 


that is, of 

(2 - 2)[(5 - A)(2 - A) - 3] + l[-2 + A] = 0 

The equation reduces to (2 — A) (A — 1) (A — 6) = 0 and the three 
eigenvalues are A = 1, 2, and 6. The associated eigenvectors are the 
vectors (x lt x 2 , x 3 ) T given by solving the equations 

(2 - A)x x - x 2 =0 

—+ (5 — A)x 2 — x 3 =0 
—3x 2 + (2 - A)x 3 = 0 


for the appropriate eigenvalue. 

If A = 1, then x x = x 2 and 3 jc 2 = x 3 and, choosing the value of x 2 
arbitrarily leads to the eigenvector 



If 2 = 2, then 

x 2 = 0 and —x x + 3x 2 — x 3 = 0 

which leads to x 2 = 0 and x x = —x 3 . The associated eigenvector is then 



— x 2 =0 

— x 2 — x 3 = 0 

—3x 2 —4jc 3 = 0 


where fi is arbitrary. 
If A = 6 then 


-4Xy 

- *i 



which leads to 4x x 
(letting Xl = y) is 


VECTORS 229 

~ x 2 > and 3x 2 = —4x 3 . The associated eigenvector 



where y is arbitrary. 

It is common practice to remove the arbitrariness of an eigenvector 
by making it of unit magnitude. Thus in the example above the three 
vectors would become, respectively 



The eigenvectors are then said to be normalised. 

In a geometrical sense the above example shows that the trans¬ 
formation matrix 



C = 

H :H) 


will transform the vector x (1) 

= I i/a/ii I lnto the vector y a) 

\Wn) by y<*> = x a > 

given 

or 

the vector x <2> 

/ 1/a/2\ . 

— In | into the vector y <2) 

U/W by y<2>=2x<2) 

given 

or 

the vector x <3> 

/ 1 /a/ 26 \ . 

= ( —4/^26) >nto the vector y' 3 > 
\ 3/V26/ b y y <3 = 6x <3) 

given 


Example (ii) Find the eigenvalues of the matrix 


2 0\ 
° 1 
-1 0 / 


and the associated eigenvectors x such that Cx = 2x. 
Solution The characteristic equation is 

— 1—2 2 0 I 

1-2 1=0 
1 -l -2 I 


which reduces to (2 + 1) 2 (2 — 1) = 0. There are three 
only two are different, i.e. 2 = 1 and 2 = -1 (twice). 


values of 2 but 
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If A = 1, the eigenvector is given by solving 
-2xx +2 x 2 = 0 

*i — x 2 +x 3 = 0 
*i — x 2 —x 3 = 0 

which leads to x 3 = 0 and x } = x 2 . The normalised eigenvector is 



If A = — 1, the equations for x reduce to 

2x 2 =0 

*i + x 2 +* 3 = 0 

*1 — *2 +*3 = 0 

which give x 2 = 0 and x x = —x a . There is only one eigenvector 
associated with the double root and this is 

( 

\-UV2/ 

in normalised form. 

In many practical problems the matrix C is symmetric (i.e. 
c tj = + Cji) and it may be shown (but not here) that in this case there 
is a distinct eigenvector for each eigenvalue whether repeated or not. 

Example (iii) Find the eigenvalues and the associated eigenvectors for 
the symmetric matrix 

C = 

Solution The equation for the eigenvalues is given by 
— 1—A 2 4 I 

2 2 - A -2 =0 

4 -2 —1—A I 

which reduces to the cubic equation (A — 3) 2 (A + 6) = 0. 

If A = —6, the eigenvector is given by 

5*! +2x 2 +4jc 3 = 0 

2xx +8 x 2 —2x 3 = 0 

4x x —2 x 2 +5;c 3 = 0 

Simple Gaussian elimination (see 8.6.1) leads to —= 2x 2 = x 3 , and 
the eigenvector is , ~ 
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in the normalised form. 

If X = 3, the eigenvector is given by solving 
—4x t +2x 2 +4x 3 = 0 
2xj — x 2 — 2jc 3 = 0 
4*! — 2x 3 —4x 3 = 0 

The three equations have a redundancy and are equivalent to the single 
equation 2x x — x 2 — 2x a = 0, hence two of the variables, say x x and 
x 3 , can be assigned arbitrary values /S and y, and the vector takes the 
form' 


In this form the eigenvector i: 
vectors 


>r V + tV 

i linear combination of the two eigen- 


It is an additional property of a symmetric matrix that the eigen¬ 
vectors are orthogonal, i.e. the scalar products of the vectors are zero. 
In Example (iii) the vectors (1, 2, 0) r and (0, -2, l) r are orthogonal to 
the eigenvectors (2, —1, — 2) T since 

(1 2 0) |-lj = 0 and (0-2 l)|-lj=0 

It follows that and y may be chosen to give any pair of eigenvectors 
which are orthogonal to each other and to the third eigenvector. For 
example, suppose the vector with = 1, y = 0, i.e. (1, 2, 0) r is called 
a and the vector associated with X = 6 is called b then the third vector 
perpendicular to the plane of these two may be called c and the three 
vectors are mutually orthogonal if c = a X b. 

Thus c may be written 

I * j k I 

C = a X b = I 1 2 0 


where i, j, and k are the usual orthogonal base vectors in the Cartesian 
frame Ox!X 2 x 3 . 
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The vector c then has the form 



and it may be verified that 



are mutually orthogonal. 

9.12 THE GRADIENT OF A SCALAR FIELD 

A scalar function of position (and possibly time) V(x, y, z, t) together 
with the region over which the function is defined is called a scalar 
field. Examples of such fields are the temperature T in a heated region 
or the electrical potential in an electrical conducting medium. 

9.12.1 Two-dimensional scalar field 

A two-dimensional scalar field may be conveniently represented geo¬ 
metrically by drawing level curves 

V(x, y, t) = c = constant 

For example, the Fig. 9.33 shows a possible set of equi-temperature 
curves across a rectangular region heated on two sides and held at 


T = T 2 



T = 0 
Fig. 9.33 


zero temperature on the other two. If the temperatures 7) and T 2 
are variable in time then the picture of level curves will be different for 
different times. The scalar, i.e. the temperature, is then called unsteady. 
However if the scalar function is independent of time there is only one 
level curve picture and the field is called steady. If the rectangular 
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region of Fig. 9.33 was initially at zero temperature and the two sides 
re suddenly raised to, and maintained at, temperatures T, and T, 
constant with respect to time), then the temperature distribution over 
the region would be time dependent long enough for the heat to flow 
over the entire region. After a sufficiently long period of time the dis¬ 
tribution would become steady and the level curve picture would be 
the same for all subsequent time. 

Suppose that V(x, y) is a steady two-dimensional scalar field and P 
is any point on the level curve (see Fig. 9.34) V(x,y) = k = constant. 
Let N be an arbitrary point (x + 3x,y + 3y) on an adjacent level curve 
corresponding to V + 3V and let PN be at angle 0 to the axis of x. 
it sis a spatial distance measured in the direction PN such that PN = 
j. at 'f hlCh Kis chan g in g 'n the direction of PN is given by 

Limit (SV/ds) as A,-* 0> i.e. by dVfds. This is called the directional 



derivative of V (in the direction PN). Using equation (26) of 5.6.2 and 
noting that V is a function of x and y, as well as a function of j. 


W^dVdx dVdy 
ds ~ 8x ds + ~8y ds 
8V 8V 

= te COs6+ ty sind 
= a cos 6 + b sin d 


( 88 ) 


= vV + b 2 ) sin (0 + a) (89) 
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in which, tan a = ajb, and a = dV/dx, b = dV/dy are the rates of 
change of Fin the directions of x and y respectively and are independent 
of 6. 

From (89), it follows that the numerically least value of dVjds is zero 
and that it occurs when 6 — 0 mln such that 


tan 



dVldx 

8V/dy' 


(90) 

(91) 


But if F(x, y) — k = constant, it follows (see section 5.6.2) that 

dy__ dVjbx 

dx ~ dV/dy 


(92) 


and comparison of (91) and (92) shows that 0 ml „ is the angle of slope 
of the tangent to the level curve. Hence the minimum value of the 
directional derivative is zero and occurs when measured in the direction 
tangential to the level curve at P. The maximum value of dV/ds is 
V(a 2 + b 2 ), i.e. 

V({ BVldx } 2 + {dV/dy} 2 ) (93) 


and occurs when 0 = 0 max such that 

0 max + «■ = %* (94) 

Thus the directional derivative is a maximum when taken in the direc¬ 
tion perpendicular to the level curve, i.e. in the direction of the normal 
to the curve and in the sense of increasing F. 

This maximum value of directional derivative together with its 
direction perpendicular to the level curve form a vector quantity which 
is called the gradient of F and written grad F or in symbolic form VF 
Thus grad F is a vector of magnitude y/{{8Vldx) 2 + (dV/dy) 2 } and of 
direction specified by 0 m ax» i- e - by the slope 


tan 0 max = cot a 
— b/a 

_ dV i 8 y 

~ SV/dx 


The vector grad F thus has components in the x and y directions given 
by 8V18x and dVjdy respectively. It follows immediately that 


8V 8V . 
grad F=VF=^. + ^J 


(95) 


where i and j are the usual unit vectors in the x and y directions respec- 
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tively. In the alternative notation 


8V 
~ 8 Xi 

using the summation convention on the symbol /. 


(96) 


9.12.2 Three-dimensional scalar field 

The definition of grad V in (95) and (96) may be extended immediately 
to a three-dimensional field, i.e. 


or alternatively 




grad V = 


(97) 

(98) 


In this case the geometrical representation of the field requires a three- 
dimensional picture of level surfaces V(x, y, z) = constant. The vector 
grad V is then of magnitude 

V{(SV/8x) 2 + (8 V/8y) 2 + (dV/dz) 2 } 

at the point P and in the direction of the normal to the surface (i.e. 
perpendicular to the tangent plane at P). 

Note The symbol V is pronounced del (sometimes nabla) and is a 
vector operator, i.e. an operator which when applied to V gives the 
vector grad V. It is often written 


_ 8,8 8 

V = ^ , + ^J + S k 


8 


Example (i) Find grad V if V = x 3 y + 3xy 2 z — 2x 2 z 2 
Solution Here 


= ( 3 * 2 / + 3y 2 z - 4xz 2 )i + (x 3 + 6xyz)\ + (3xy 2 - 4x 2 z)k 


Example (ii) Find the equation of the normal to the ellipsoid 
2x 2 + 3j 2 + z 2 = 21 at the point (3,-1, 0). 


Solution Let V(x, y, z) = 2x 2 + 3 y 2 + z 2 , for all x, y, z, be a scalar 
field, then grad V is perpendicular to the level surface V(x, y, z) = 21, 
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i.e. is in the direction of the normal to the surface of the ellipsoid. Now, 
grad V = 4jc i + 6y j + 2z k 
= 121 —6j + 0k 

at the point (3,-1, 0). The equation of the required normal is 
x-3 y±_ 1 x 

12 -6 0 

The normal is the line 2 y + x = 1, z = 0, i.e. the line 2y + x = 1 in 
the (x, y) plane. 

Example (iii) A particle at the point P (x, y, z ) is subjected to a force 
F whose magnitude is inversely proportional to the square of the dis¬ 
tance OP and whose direction is towards the origin 0 (the inverse square 
law). Verify that there exists a scalar potential function V which is 
inversely proportional to OP and such that F = grad V. 

Solution Let OP = r 
then V — k/r 

where k is a constant and r = VC* 2 + y 2 + z 2 ). 

Then grad V = fcV(l/r) 

= k(— l/r 2 )V(r), 

using the chain rule of differentiation 



and two similar results for differentiation with respect y and z. 

Now r 2 = x 2 + y 2 + z 2 

and so Irdr/dx = 2x, 

i.e. drjdx = xjr, with similar results Srjdy — y/r, and dr/dz = z/r. 
Finally, 

F = grad V= *(-l/r 2 ) 

= ~(klr 3 )r. 

This last result shows that F is proportional to (1/r 2 ) and that F is in 
the direction of — r, i.e. towards O. 

9.12.3 The directional derivative 

Suppose a is a given vector, then the directional derivative of V in the 
direction of a is dVjds where s is measured in the direction of a. 
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(IV dV ¥ . dVdn 

- 7 - = Lim -r- = Lim -j- 
ds os on os 

is measured in the direction of the normal to the level curve 
(see Fig. 9.35). 



_ dV n.a 

“ d " l a l 

(dVjdn)a. a 
“ 1*1 

the unit normal and 8 is the angle between n and a. Finally, 
dV _ VV. a _ grad V. a 
ds ~ aj “ |aj 


(99) 
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Example Find the directional derivative of V = 3 xyz — x 2 y + 2 xz 2 

at the point (1, 1, 1) in the direction specified by the join of (2, 1, 0) to 

(-1,3,4). 

Solution The required direction is a and 

a = —3i + 2j + 4k. 

The directional derivative in the direction of a is 

[(3 yz — 2 xy + 2z 2 )i + (3 xz — x 2 )j + (3xj + 4xz)k ]. [—3i + 2j + 4k] 

V(9 + 4 + 16) 

which at the point (1, 1, 1) becomes 23l\/29. 

EXERCISE 9 

1 State which of the following are scalar quantities and which are 
vector quantities: speed, weight, pressure, time, voltage, tension, 
momentum, temperature. 

2 Two aircraft start from the same point on the Equator. The first 
travels 500 kilometre due East and then 500 kilometre due 
North while the second travels 500 kilometre due North and 
then 500 kilometre due East. Do they arrive at the same point? Are 
these directed magnitudes vectors? 

3 Free vectors a and b are inclined at an angle 0, show that 

|a| - |b| < |a + b| = Via 2 + b 2 + 2 ab cos 0) < |a| + |b|. 

4 The position vectors of the four points A, B, C, and D are respec¬ 
tively, a, b, 3a + 2b, and 2a — b. Express AC, DB, CB, and CA in 
terms of a and b. 

5 In any triangle ABC prove by vector methods that (i) the straight 
line joining the middle points of AB, AC is parallel to BC and half 
the length of BC, and (ii) if D, E, F are the mid-points of BC, CA, 
and AB respectively and O is any other point, 

OD + OE + OF = OA + OB + OC. 

6 Two forces act at the comer A of a quadrilateral ABCD and two 
at the corner C. The forces are represented in magnitude and 
direction by AB, AD, CB, and CD respectively. Show that their 
resultant is represented by 4PQ where P and Q are the mid-points 
of AC and BD respectively. 

7 If a and b are the position vectors of two points A and B find the 
position vector of a point C on AB produced such that AC = 3 AB. 

8 Vectors from O to A, B, C, and D are represented by a, b, c, and d 
respectively. The joins CB and AD intersect at P. If AP = \A D 
and CP = \CB show that d — b = 2(c — a). 
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9 A, B, C, and G are the points (2, 1, 3), (3, -1, -2), (~2, 3, 5), and 
(1, 1^2) respectively and P is any point. Show that PA + PB + PC 
— 3PG and state what can be deduced about the point G. 

10 Find the magnitude and direction of the resultant of the vectors 
® 7 * + 2j — 5k, 3i — j — 2k, — 2i — 2j + k; (ii) of magnitudes 
6, 5, and 3 in the directions [2:1:2], [4:3:0], and [2:2:-1] respec¬ 
tively. 


11 Find the centre of gravity of masses 3, 4, 5 at the points (2, -3, 3), 
(5. —3, —4), and (2, —3, —1) respectively. 

12 Forces of magnitudes 1, 2, and 3 act at one corner of a cube along 
the diagonals of the faces meeting at that corner, find the magnitude 
of their resultant and its inclination to the edges of the cube. 

13 The position vector of a point A is 2i — 3j + k. Find the vector 
equation of the line through A in the direction of the vector 7j + k. 
Find the coordinates of the point B on the line such that AB = 6y/2. 


14 Find the equation of the line passing through the points (1, I, 1) 
and (—2, —1, 3). 


15 A and B are the points whose position vectors are 3i + 4j — k and 
—3i -f 7j + 2k respectively. Find the position vector of the point 
that divides the join of AB (i) internally in the ratio 3:4, and (ii) 
externally in the ratio 4:7. 


16 A force of 13 units acts at the point (I, 3, -4) and its fine of action 
is parallel to the join of the point (1,0, 3) to the point (5, 1, 7). Find 
the vector form of the force. 


17 Find the scalar projection of the vector 21 - 3i + 7k on to the 
vector 61 + 4j — k. 

18 Find the vector projection of 21 + 3j on to the vector 4j + 5k. 

19 Show that the perpendicular distance from the origin to the line 
r = a + tb is given by |a - (a.b)b/P|. Find the perpendicular 
distance from the origin to the line 3j + 4k + /(5i + 5j). 

20 Show that the perpendicular distance from the point whose position 
vector is c to the line r = a + tb is |(c - a) x b\/b. Find the 
perpendicular distance from the point P(- 1, 3, 4) to the join of 
A( 2, 3, 5) and B(-6, 7, 9). 


21 A particle moves in a straight line from a point A( 0, 0, 0) to a point 
B( 0, 1, 2) and then in a second straight line to a point C(l, 0, 3) and 
finally in a third straight line back to A. It is acted on by a force 
21 + 4j + 3k while moving along AB, by a force 2i + 3k while 
moving along BC and by a force —3k while moving along CA. 
Find the total work done by these forces. 
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22 Prove that the vectors a = 2i + jj + 2k, b = i + 3j — 7k, 
c = 3i + 4j — 5k form a right angled triangle. 

23 Find the area of the parallelogram whose sides are the vectors 
i + 3j — k, 2i — j + 2k and determine a unit vector normal to 
its plane. Find also the equation of the plane. 

24 What is the unit vector perpendicular to each of the vectors 
2i — j + k and 31 + 4j — k? Find two unit vectors in the plane 
of these vectors, one perpendicular to each of the given vectors. 

25 Find the moment about a point i + 2j — 2k of a force represented 
by 3i + 4k acting through the point whose position vector is 
i - j + 3k. 

26 The vector c = li + mj + nk is perpendicular to the vectors 
a = i + 2k and b = — i + 2j + 3k. Show that l-.m-.n = 4:5: —2, 
(i) by solving the equation c . a = 0 = c. b, and (ii) by evaluating 
a X b. 

27 Forces i — j + 3k, — 3i + 3j — 2k, and 2i — 2j — k act at the 
points (1,1,1), (2, 3,0), and (—3,4,1) respectively. Show that they 
are equivalent to a couple of moment V138 about an axis whose 
direction ratios are [—4:1:11]. 

28 If a X b = n and n . c = « where a = aj + a 3 k, b = bj + b 2 j, 
and c = c 2 j + c 3 k show that a = a 3 b 2 c 3 + a a byC 2 . 

29 Show that the line through the point (0, 4, 4) in the direction of the 
vector [—1,2, 1] intersects the line through the point (1, 4, —1) in 
the direction of the vector [0,1,-2] and find the point of inter¬ 
section. 

30 Find the shortest distance between the lines 

(x + 3)/2 = (y — 6)/3 = (z — 3)/(—2) 
and x/2 = (y — 6)/2 = z/(—1). 

31 Find the triple scalar product of the three vectors a = 2i — j + k, 
b = i — k, and c = 3i 4- 2j. 

32 Find values of /S if 2i + /Sj + k, /Si + j + k, and /Si + 3j — 3k, 
are coplanar vectors. 

33 Find the coordinates of the point of intersection of the line 
r = a + tb and the plane r . n = 8 when a = 8i + 7j + 12k, 
b = 3i + 4j + 6k, and n = 4i + 3k. 

34 Show that the system of equations 

3x — 2y + z = 1 
2x + 4y — 2z — 3 
lx + 6y — 3z = 7 

is consistent but indeterminate and illustrate the result geometrically. 
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35 Show that the system of equations 

2x + 3y - 7z = 2 
x~ y =4 
3x + 2y - Iz = 1 

is inconsistent and illustrate geometrically. 

36 Prove that (a x b). (a x c) = (b. c )a 2 - (a . b)(a . c). 

37 ABC is a triangle with a right angle at C. If CB = a, CA = b, and 
AB = c use the scalar product of vectors to prove Pythagoras’ 
theorem. 

38 P u P 2 , and P 3 are the points whose position vectors are r 1; r 2 , and 
r 3 respectively. Show that the equation of the plane through the 
points is given by 

(r - r x ). (r a - r a ) x (r 3 - r,) = 0 
and express the result in the form of a determinant. Find the 
equation of the plane through the points (2, 1, 4), (2, 0, 1), and 
(-1,3,0). 

39 Given r = cos ti - sin 2tj + f 2 k find dr/dt, dh/dt 2 and \d 2 r/dt 2 \. 

40 The position vector of a point at time t is 

r = 2ti + i t 3 j - 2k. 

Find the velocity and acceleration and show that 
4v 2 = 16 +/ 4 

where v and/are the magnitudes of the velocity and acceleration. 

41 If r = a sin cot + b cos cot where a and b are constant vectors, show 
that d 2 rldt 2 = — co 2 r. 

42 Show that r = 2i cos t + 2j sin t lies on a circle for all t. Show that 
the equation of the tangent at the point is 

r = 2i cos ti + 2j sin t a + a(2j cos t x — 2i sin fj) 
where a is a parameter. 

43 By differentiating a. a show that a. (dn/dt) = a(da/dt). If a is of 
constant length deduce that da/dt and a are at right angles. 

44 A particle moves on a circle r — a with a variable angular velocity 
co. Find the components of acceleration. 

45 Verify the expressions f and r6 for the velocity components in polar 
coordinates by differentiating and resolving the relationships 
x = r cos 6, y = r sin 0. Also verify the acceleration components 

73 ^ 

46 If a = 2/i - t 2 j + 2k, b = 3i + tj - 2tk, and c = t 2 k find 
d( a . b )/dt, d( a x b )Jdt, and d( a . b x c )jdt, (i) by differentiating 
the products, and (ii) by first evaluating the products. 
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47 A body moves on the curve r = ae flcota where a and a are constants 
(the equiangular spiral) in such a way that dd/dt is constant and 
equal to <a. Find the polar components of the velocity and show 
that the resultant velocity makes a constant angle with the radius 
vector. Find also the polar components of the acceleration. 

48 A particle is moving such that its acceleration at time t is 2i + ?k. 
If it initially passes through the point (1, 2, 3) with a velocity 
2j — k, find the parametric equations of its path. 

49 Obtain the eigenvalues and normalised eigenvectors of the matrices 



50 Given that the matrix 



has an eigenvalue 18 and corresponding eigenvector 



find the other eigenvalues and eigenvectors. 

51 Find the eigenvalues and three mutually orthogonal eigenvectors 
of the symmetric matrix 



52 Find the angle between the normals to the surfaces x 2 + y 2 + z 2 = 14 
and z = x 2 y 2 — 2 at the point (I, — 2, 3). 

53 P is a fixed point 2i — j + k and Q is the general point xi + y\ + zk. 
The function <j>(x, y, z ) is the square of the distance PQ. 
Evaluate grad </>, and find the directional derivative of <f> in the 
direction i + 2j + k when Q has position vector 31 + j — 2k. 

54 A particle is constrained to move along the helix 

r(s) = cos cos i + sin cos j + cus k 

where m 2 + a 2 = 1. It is acted upon by a force field F = V<f> where 
<f> = log r and r = |r|. Show that the work done by the force on the 
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particle during 


incremental displacement dr is 
F . dr = f\. ds 


and hence when the particle moves from s = 0 to s = Itt/co the total 
work done is log y/(\ + 47r 2 a 2 /to 2 ). 



10 

Complex Numbers and Complex 
Variables 


10.0 INTRODUCTION 

From a historical point of view the theory of complex numbers arose 
from the study of the solution of algebraic equations particularly of the 
quadratic equation ax 2 + bx + c = 0. The equation x 2 — 4x — 5 — 0 
or (x — 2) 2 = 9 has two unequal roots given by x — 2 = ±3; the 
equation x 2 — 6x + 9 = 0 or (x — 3) 2 = 0 has two equal roots x = 3 
but the equation x 2 — 4x + 13 = 0 or (x — 2) 2 = —9 has roots 
which involve the finding of a number whose square is —9. Such a 
number can be represented by the introduction of a symbol i (some¬ 
times j is used) such that i 2 = — 1 or i = \f — 1 and the two roots of 
x 2 — 4x + 13 = 0 can then be formally written x — 2 = ±13. The 
invented number i 3 was called an imaginary number, the other numbers 
such as 3, —3, or \/3 with which familiarity was already established 
being called real numbers. The numbers of the form 2 ± i'3 were called 
complex numbers. The main reason for the introduction of such numbers 
was for uniformity so that it could be said that any quadratic equation 
always had two roots, real and unequal, real and equal, or complex. 

The term imaginary is an unfortunate legacy from the past and 
implies that the numbers have some mystical character. From a mathe¬ 
matical point of view the imaginary numbers or the more general 
complex numbers are considered well-behaved providing they obey a 
consistent set of algebraic rules, while from an engineering viewpoint 
the new numbers are of immediate interest and acceptable providing 
that they can be successfully used in an engineering context. For the 
latter it is more advantageous to ignore the historical aspect of complex 
numbers and to consider the symbol i as an operator. Note that i here 
has no connexion with the unit vector i (of i, j, k) of Chapter 9. 

10.1 SYMBOL i AS AN OPERATOR 

Consider any vector OP lying in a plane and of length a units, i.e. 
a = OP. The symbol (— 1) may be thought of as an operation which 
when applied to OP has the effect of turning OP through two right 
angles in the anticlockwise direction, this being the mathematically 
positive direction, and leads to another vector OQ such that 
OQ = (—l)OP = (—l)a = -a. 


( 1 ) 
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A ‘mid-way’ stage in this operation would be to turn the vector OP 
through one right angle in the anticlockwise direction. Suppose tha t 
the symbol / is given to this operation then iOP will be represented, in 
Fig. 10.1, by the vector UR such that 

OR — iOP = ». (2) 



A further application of the operator i to the vector OR will then 
turn it through a right angle to the position of OQ so that 

-a = OQ = i(iOP) = i 2 OP = i 2 a. (3) 

Equation (3) shows the effect of a double application of the operator i. 
That is ii or i 2 turns the vector through two right angles and has the 
same effect as the operation — 1 of equation (1). In an algebraic sense 
(3) may be interpreted as showing the equivalence of i 2 and — 1 and 
in this sense i could be said to be ^/—l. As mentioned in section 10.0, 
the statement i = \/— 1 maybe used as a starting point for the historical 
development of a complex number theory. The operational approach 
is simply to consider i as an operation that turns a vector in a plane 
through one right angle and i 2 (or — 1) as the operator that turns the 
same vector through two right angles. 

A third application of the operator i will turn OQ into OS, i.e. 

OS = i(i 2 OP) = POP = -iOP (4) 

and i 3 = ip = ( 5 ) 

The operator i 3 applied to OP has the same effect as the operator (— i) 
applied to OP. The first turns OP through three right angles in an anti¬ 
clockwise direction, the second turns OP through one right angle in a 
clockwise direction and in either case the vector OS results. 
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10.1.1 The Argand diagram 

In Fig. 10.2 let X'OX produced be an axis on which all real numbers 
from —co to +oo can be represented, O being the origin zero. For 
example the number +3 is represented by the point P on the line OX 
distant 3 units from O and the number —3 is represented by the point 
Q distant 3 units from O in the direction OX'. These numbers may be 
represented also by the vectors OP and OQ (or — OP) but by convention 
they are simply written +3 and —3 the direction of the vector being 
understood. Now draw an axis Y'OY to intersect X’OX at right angles 
at O then a number such as i2 may be represented by the point R distant 



2 units from 0 in the direction OY and — /2 by the point S distant 
2 units from O in the direction OY'. Again, the number i2 may be 
represented by the vector OR being the vector +2 turned through 
one right angle in the anticlockwise direction, i.e. OR = /(+2) = il. 

Consider the addition of the two numbers 3 and il. The sum is 
formally written 3 + /2 and may be represented by the point T or by 
the vector OT, the sum of the vectors OP and OR (see section 9.3.3). 
Thus in Fig. 10.2 the complex number 3 + il is represented either by 
the point T or by the vector OT. 

The axis X'OX containing all the real numbers from — co to + oo 
is usually called the axis of real numbers or the real axis and the axis 
Y'OY containing all the imaginary numbers from —/oo to +/oo is 
called the axis of imaginary numbers or the imaginary axis. It is usual 
to let any arbitrary real number be denoted by x and to let any arbitrary 
imaginary number be denoted by iy (where y is a real number). The 
sum of these numbers is then denoted by z, i.e. 
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The two numbers x and y are called the real part and the imaginary 
part of z respectively and written Re z and Imz. 

The geometrical interpretation that any complex number z may be 
represented in the Cartesian plane by a point whose coordinates are 
(x,y) where the x-axis contains all the real numbers and the j-axis 
contains all the imaginary numbers is called the Argand diagram. The 
Cartesian plane is often called the z-plane. 

Note 1 There is rarely any confusion between z the complex number 
and z the spatial coordinate of (x, y, z) because the algebra of 
complex numbers is applied to problems essentially of a two-dimen¬ 
sional nature. 

Note 2 Although y is called the imaginary part of z it is a number as 
real as x. Some writers call iy the imaginary part of z. 

Note 3 Too much emphasis should not be placed on the representation 
of a complex number as a vector because of the operator i in the 
representation. For instance if ej and e a are two orthogonal unit base 
vectors in a Cartesian coordinate system then by equation (33) of 
section 9.6.1, ei. = 1 and e 2 . e 2 = 1 but if n is a unit vector 

along OX in the Argand diagram so that in is the unit vector along 
OY then n. n = 1 whereas (in). (in) = — 1, i.e. a different product 
is obtained depending on the direction taken. 

Note 4 The real numbers are a particular case of the complex numbers. 

Note 5 Numerical complex numbers are frequently written in the form 
3 + 2i instead of 3 + i2. 

10.2 THE ALGEBRA OF COMPLEX NUMBERS 

The rules of common algebra may be applied to complex numbers 

providing that any product i 2 is replaced by —1. 

10.2.1 Equality of complex numbers 
Two complex numbers are equal 

*i + tyi = x 2 + iy 2 

if and only if Xi = x 2 and y x = y 2 , as is immediately obvious by 
plotting the two points on the Argand diagram. The zero complex 
number is 0 + i'O represented by the point O in the Argand diagram. 
A complex number is zero if and only if its real and its imaginary parts 
are zero. 

10.2.2 Addition and subtraction of complex numbers 

The sum of the complex numbers z 1 and z 2 is given by 

(*i + i>i) + (x 2 + iy 2 ) — (Xj + x 2 ) + i’Oj + y 2 ) 
and is identical with vector addition. 
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The difference of two complex numbers z x and z 2 is the sum of z x 
and —z 2 , i.e. 

Zi — z 2 = z x + ( z 2 ) 


= (*i + i>i) - (x 2 + iy 2 ) 
= (*i - x 2 ) + i(y x - y 2 ). 



In Fig. 10.3 the points P and Q represent z x and z 2 respectively, R is 
the sum z x + z 2 and S is the difference z x — z 2 . It is sometimes an 
advantage to represent z x — z 2 in vector form by OS or its equivalent 
vector QP. (Compare with section 9.3.3.) 

10.2.3 Multiplication of complex numbers 

Given two complex numbers z x and z a the product is another complex 
number z 3 such that 

Z 3 = ZjZ 2 

= (^i + *>i )(^2 + iy 2 ) 

- (^ 1^2 - JiTa) + i(*i 72 + x 2 yi). (6) 

The result (6) shows that multiplication is commutative, i.e. 


The product of two complex numbers is real providing that 
*i : Ji = -x 2 :y 2 . 

In particular the numbers x x + iy x and Xj. — iy x whose product is 
x \ + y\ af e called conjugate complex numbers. If z = x + iy is any 
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general complex number the conjugate is denoted by z, so that 
z = x - iy. In the Argand diagram z is the reflection of z in the real 
axis. Note that 

Zi ± z 2 = z 1 ±z 2 
ZiZ 2 = ZjZ 2 . 

Example (i) (3 + 2/) + (5 - 7/) = 8-5 i 
(3 + 2i) - (5 - 7i) = -2 + 9 i 
6(1 - 50 = 6 - 30/ 

3(2 - 3/) - (7 - 2/) + (1 + 7/) = (6 - 9/) - (6 - 9/) 

= 0 + 1 ' 0 . 

Example (ii) (4 + 2/)(3 - /) = (12 + 2) + /(6 - 4) 

= 14 + 2/. 

Example (iii) /(I + 21) = / + 2/ 2 

= -2 + /. 

Note that if (1 + 2/) in example (iii) is represented by the vector OP 
then /(I + 2 i) is represented by the vector OQ obtained by turning OP 
through one right angle in the positive sense. 

10.2.4 Division of complex numbers 

The value of (x x + iy)l{x 2 + iy) is defined as a third value X + iY 
such that 

*i + iy, = (x 2 + iy)(X + iY) 

- (x 2 X -y 2 Y) + i(x 2 Y + Xy 2 ). 

From the equality of complex numbers it follows that 
x 2 X — y 2 Y = 

and x 2 Y+Xy 2 =y 1 

and hence X+iY =^j + ~ x ^. 

x l + y\ x \ + y\ 

This result can be obtained by an alternative but equivalent process as 
follows. The numerator and denominator of the quotient 

(*i + *>i)/(*2 + iy) 

are each multiplied by the complex conjugate of the denominator so 
as to make an equivalent quotient with a real denominator. 



250 


MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


Thus 


fa + iyi)(x 2 - fys) 

fa + *>a)fa - '> 2 ) 


= fas 2 + J 1 J 2 ) + i(x 2 y 1 - xxy 2 ) ^ 

The result (7) need not be remembered but each division required 
should be evaluated using this alternative process. 


Note that (7) reduces to xjxz ifji = 0 = y a . 

(3 + 2/) (3 + 20(4 + 30 

Sxampk ( 1 ) (4”—*3i) = (4 - 30(4 + 3i) 

(12 - 6) + /(9 + 8) 
(16 + 9) 

= (6 + 170/25. 

(1 +0(2-0 3 + i 

Example ( 11 ) (1 _ f)(3 + = 4 ^2i 

(3 + Q(4 + 2Q 
16 + 4 
= (1 + 0/2. 


10.3 POLAR FORM OF A COMPLEX NUMBER 
Any complex number z = x + iy may be represented by the point P 
on the Argand diagram as in Fig. 10.4, i.e. by the point whose Cartesian 
coordinates are fa _y). 



It is frequently an advantage to represent the point in terms of its polar 
coordinates (r, 0). Now, 

z = x + iy = r cos B + ir sin B 
= r (cos 6 + i sin B). 

In abbreviated form this is often written r cis B or r[6, so that 
r(cos 0 + / sin 0) = r cis 6 = r[B. 


( 8 ) 
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10.3.1 Modulus and argument 

The polar coordinate r is a positive quantity being the magnitude of 
the vector OP. It is called the modulus of z and is written mod z or 
1 * 1 . Le - 

mod z = \z\ = r = V(* 2 + y 2 )- (9) 

The polar coordinate 0, giving the direction of the vector OP, is 
called the argument or amplitude of z and written argz, ampz, or 
am z. The value of 0 is found from 

tan 0 = y/x (10) 

the quadrant in which 0 lies being determined from the equations 
x = r cos 0, y = r sin 0, —180° < 0 < 180°, 

It is sometimes convenient simply to plot the complex point on the 
Argand diagram and hence to determine the quadrant in which 0 must 
lie. The value of 0 within the above range is often called the principal 
value of arg z. Note that — 1 = 1 /180° and that if z = r/0 then 
2 = rJ—0. ~ 

Example (i) Express 2 + 3/ in the polar form. 

Solution r = \/(4 + 9) = VI3 

and tan 0 = 3/2 

0 = 56° 19'. 

In abbreviated form 2 + 3/ s yi3 /56° 19 ' s= VI3 cis (56° 19'). 
Example (ii) Express —2 — 3/ in polar form. 

Solution Here, r = V13 
and tan 0 = —3/—2 

0 = -(180° - 56° 19') = -123° 41'. 

10.3.2 Multiplication and division in polar form 
Let 

z x — r x { cos 0! + / sin 0 X ) 
and z 2 = r 2 (cos 0 2 + / sin 0 2 ) 

then the product z 2 z 2 is given by 

ZiZ 2 = r x r 2 (cos 0 X + i sin 0i)(cos 0 2 + / sin 0 2 ) 

= JVattcos 0 2 cos 0 2 — sin 0 X sin 0 2 ) 

+ /(sin 0! cos 0 2 + cos 0 X sin 0 2 )] 

— r x r 2 [cos (0! + 0 2 ) + i sin (6 1 + 0 2 )] 


(ID 
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or in abbreviated form 


O'l/fyfo/fla) = r 1 r 2 l(d 1 + 0 2 ) = r ± r 2 cis (0! + 6 2 ) (12) 


Thus to multiply two complex numbers in polar form multiply the 
moduli and add the arguments. Similarly, 
z 1 _ ^(cos 0 ± + i sin OQ 
z 2 — r 2 (cos 6 2 + i sin 0 2 ) 

_ r ± (cos 0x + i sin 0j)(cos 0 2 — i sin 0 2 ) 

~ r 2 cos 2 0 2 + sin 2 0 2 

r 1 cos (d 1 — 0 2 ) + i sin (d 1 — 0 2 ) 

= 7 2 l 

or, in abbreviated form. 


r 2 ^2 ~ r 2 


ie, ~ 62 . 


(13) 


Thus, to divide two complex numbers in polar form, divide the moduli 
and subtract the arguments. 

The argument of a product, or quotient, of two complex numbers 
should always be given within the range of values —180° to +180°. 
Note the following results 

kiZal = NN; |zi/z 2 | = N/N; zz = |z| 2 = |z| 2 = r 2 . 
Example (i) In polar form (—1 + i)(l + i\/3) may be written 
(V2 /135°) (2/60°) or 2V2/195 0 . Using the principal value for the 
argument this is 2\/2 /—165° . 


Example (ii) Express (—i)/(2 + 2/)(—3 + \Zii) in polar form by two 
different methods. 


Solution 

(a) (2 + 2i)(—3 + V30 = (-6 - 2V3) - i (6 - 2V3) 


V6KV6 + a/2) + i(V6 - V2)\ 
(V6 - -y/2) + f(V6 + a/2) 

16^6 

_ -y/16 /tan- 1 (-y/6 + V2)/(V6 - -y/2) 


16V6 
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(b) 


1 /-W2 


(2 + 2/)(—3 + V30 (2V2/7r/4)(2V3/57r/6) 

^ MW 2 4- tt/4 + 5tt/6) 

- 4^6 

4^8 ££• 


10.3.3 Representation of multiplication and division on the Argand 
diagram 

Given a complex number z, represented by the point P or the vector 
OP in the Argand diagram, an examination of equation (12) shows that 
the effect of multiplying by z 2 is to rotate the vector OP through an 
angle arg z 2 in the positive (anticlockwise) direction and to change the 
magnitude of OP in the ratio nr 2 :r u i.e. r 2 :1 or |z 2 |: 1. Similarly from 
(13) the effect of dividing Zj by z 2 is to rotate the vector OP through 
an angle arg z 2 _i n the negative (clockwise) direction and to change the 
magnitude of OP in the ratio 1: |z 2 |. 



In Fig. 10.5, P represents z lt Q represents z 2 , and I represents the 
point 1 + /0. Then R will represent the point z 2 z 2 and S’ will represent 
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the point zjz a where R and S are such that the triangles OQI, ORP, 
and OPS are similar so that 

OR _ OQ 
OP ~ 1 

os_ j_ 

OP ~ OQ 

and /ROP = jPQS = /QOI = arg z 2 . 

10.3.4 Inequalities 

Complex numbers cannot be ordered in the same way as real numbers 
so that a statement such as z x > z 2 or Xj. + iy x > x 2 + iy 2 is meaning¬ 
less. However, numbers can be partially ordered by the size of their 
moduli, e.g. |3 + 2i| > |1 — ij. From Fig. 10.3, it is evident that 

ki + *a| < |*i| + |*a| 

l*i “ z sl < l*il + N 
l*i *a| > lki| - Nl. 

These results should be compared with those of example (ii) of section 
9.3.3. 

10.4 COMPLEX EXPONENTIAL QUANTITY 
In Chapters 4 and 5 it was found that for all real x 

exp x = e* = f x n /n\ = 1 + x + * 2 /2! + . . . (14) 

sin x = f (—l)"x 2ft+1 /(2n + 1)! = x - x 3 /3! + x 5 /5! + . . . 
»=o (15) 

and cos x = f (-l)"x 2n /(2«)! = 1 - x 2 /2! + x 4 /4! + . . . (16) 

For complex z, a quantity exp z may be defined as in equation (14), i.e. 

exp z = 1 + z + z 2 /2! + z 3 /3! + . . . = f z"/nl. (17) 

It is not immediately evident that exp z behaves like an exponential 
quantity and a certain amount of discussion of the properties of (17) 
is required. 

The series 2 \ z \ n l n - or 2 r "/«! is convergent for all r, i.e. for all |z|, 

and hence the series J z n /n! is absolutely convergent (see Chapter 3). 

Note that if z is real, that is if z = x = r, then the series converges to 
e*, i.e. 


or OR = |z 1 ||z 2 |, 

or OS = N/|z 2 |, 


exp z = exp x = e*. 
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If z is replaced by iy then (17) gives 
exp (i» = 1 + (iy) + (i» 2 /2! + (i» 3 /3!. . . 

= (1 - fl 2! + J74! -•..) + 'O ~ J 3 /3! +>' 5 /5! - . . .) 
= cosj-f isinj (18) 

using (15) and (16). 

To establish the result 

exp (z ± + z 2 ) = exp (z a ) exp (z 2 ) (19) 


requires a theorem concerning the multiplication of absolutely con¬ 
vergent series the proof of which cannot be given here. The theorem 

states that if S x = 2 a n and S a = 2 b n are two absolutely convergent 
series, then 

where c n = 2 a tK- 1 - 

To form the result (19), let S x = exp z x and S 2 = exp z 2 so that 
a n = z n Jn\ 

and b n = z"/«!, 

then c n = 2 o l4ltl][zr t K'i-t)l} 

= (!/«!) i 

= (Zi + z 2 )7«! 

the last line following from the binomial theorem. Hence by the quoted 
theorem 

(| z x/ n ') (| z ll n ') = 1 ( z i + ZaY/n ! 

and the result (19) follows at once. 

Using (19) 

exp z = exp (x + iy) 

= exp x exp (iy) 

— e x (cos y + i sin y) (20) 

and expz is a complex number of modulus e 1 and argument y. 
The complex number exp z of (17), the sum of the series 2 z n /«!, has 
the properties of an exponential quantity and will be designated e 2 . 
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Thus (19) may be written 

e 2 i + z s — e Zi e Za 

and (20) becomes 

e 2 = exp z = e 1 (cos y + i sin y) 

= e* exp (iy) 

= e I e to (22) 

using (18) in the form 

e <y = cos y + /' sin y. (23) 

The equation (19) may be extended to read 

eSl+a2+s! 3 + ...+*„ _ e sx e 2 a e *3... e ®n. (24) 

The polar form of the complex number may now be written 

z = x + iy — re w = r(cos 0 + i sin 6). (25) 

The exponential form of the complex number is called Euler's form and 
6 is now expressed in radian measure. 

Example i = e l7r/2 ; — 1 = e* 77 ; 1 = e i0 = e 2l7r = e 2ki7r , Jc an integer; 
y/3 — i = 2e~ lnie ; -l + i = y/2e 3inli ; 

2 + i — V5e w , 6 = tan' 1 ® = 0.46. 

Unless stated otherwise the argument 0 is always expressed in 
principal value form, i.e. —n < 0 < +w. It should be noted, however, 
that given an argument 0, then 

e <0 _ e (ie+ 2 tkn) f p- an integer. (26) 

10.5 DE MOIVRE’S THEOREM 

This theorem states that for all n 

(cos 0 + i sin 0) n = cos nd + i sin nd. (27) 

There are three cases to consider: 

(i) n a positive integer, 

(ii) n a negative integer, and 

(iii) n rational but not an integer. 

Case (i) 

From (25) of section 10.4, 

cos 0 + i sin 0 = e iS 
and so (cos 0 + i sin 0) n = (e w ) n 

= (e <9 )(e i8 )(e w ) . . . (e‘ a ) 

= e‘ n0 

using (24) of section 10.4. Hence, 

(cos 0 + i sin 0) n = cos nd + i sin nd. 


( 28 ) 
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Case (ii) 

Let n = —m where m is a positive integer and write 
(cos 0 + i sin 0)" = (e i8 ) n 

= (e‘T m 

= l/(e‘ a ) m 
= l/e <mfl 

using equation (24) of section 10.4. 

From equation (21) of section 10.4 with z 2 = - z x 
1 = e° = e Zl e -s i 

or e-^ = l/(e Zi ). (29) 

Hence, (cos 0 + i sin 0) B = l/(e imfl ) 


= cos nO + i sin n0. (30) 

Case (iii) 

Let n = plq where p and q are integers without a common factor (other 
than unity), then 

(cos 0 + i sin 0)" = (e w ) n 


= (e <pfl ) l, «, since p is an integer 

= s ; nce q j s an j n t e g ei . 

= ( e ip8/ «) 

= e ,n * 

= (cos nd + i sin w0). (31) 

Thus, for all n 

(e‘T = e ln * (32) 

which is Euler’s form of de Moivres theorem. Note that if n is a rational 
fraction plq then e tpei “ is only one of the possible values of (e w ) p, ‘ 1 . In 
general terms, from (26) of section 10.4 

e ipe,q = (e w+2ik ^) p "> ( 33 ) 

Example (i) (1 + if’ = (^2 e^' 4 ) 27 

_ 234/2 g27far/4 
_ 234/2 e 6/jr + 3Jjr/4 

= 2 27/a (cos 37r/4 + / sin 3 tt/4) 

= 2 13 (—1 + i ). 
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Example (ii) Find cos 40 and sin 40 in terms of powers of cos 0 and 
sin 0 respectively. 

Solution Since (cos 0 + i sin 0 ) 4 = cos 40 + i sin 40 then 
cos 40 + i sin 40 = cos 4 0 + 4/ cos 3 0 sin 0 

— 6 cos 2 0 sin 2 0 — 4/ cos 0 sin 3 0 + sin 4 0 

after using the binomial expansion. 

Equating real and imaginary parts (for equality of complex numbers), 
cos 40 = cos 4 0 — 6 cos 2 0 sin 2 0 + sin 4 0 
= 1 — 8 cos 2 0 + 8 cos 4 0 
and sin 40 = 4 cos 3 sin 0 — 4 cos 0 sin 3 0 

= 4 sin 0 cos 0 (cos 2 0 — sin 2 0). 

10.5.1 The nth roots of unity 

From equation (26) of section 10.4, with 0 = 0, 1 = e 2tkn where k is 
an integer k = 0, 1, 2, 3, . . . . Suppose the nth roots are denoted by 
z 0 , z lt z a , . . . then 

z k = 1/1 = ( e 2ifc7r ) 1,n 

_ Q^iknin from equation (32), 

= cos ( 2 & 7 r/n) + i sin (Ikn/n) (34) 

The expression (34) has n distinct values with argument in the range 
(— 7 r, +7r), for any n successive values of the integer k. For k = 0, 1, 2, 
. . ., (n — 1 ) the successive roots are 

z 0 = e° = 1 ; = e 2i,r/n ; z a = . . .; 

z — e 2(n-l)Or/n _ g-aijr/n. 



Fig. 10.6 
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The other values of k give values of z k which simply repeat those already 
found, for example k = n and k = n + 1 give z„ = e 2in,r/n = e 2l7T 
= 1 = z 0 and z n+1 = e 2<n+1)i ” ln = e^e*” = e 2Mn = Zl . Thus it 
will be seen that there are n values for the «th roots of unity, no more 
and no less. Put another way, it follows that the equation z" = 1 has 
n distinct roots when n is a positive integer. If n is even, two of these 
roots are the real roots ±1 and the rest are all complex; if n is odd 
then one of these roots is real, i.e. +1, and the rest are complex. 

The roots given by (34) may be conveniently illustrated by plotting 
on the Argand diagram. The first root is the point 1 + iO and the others 
are distributed on the circumference of the unit circle, (i.e. the circle 
of radius 1, centre z = 0) at equal angular intervals of 2-njn as shown 
in Fig. 10.6. 

Example (i) Find the fourth roots of unity. 

Solution Here, 

z k = yi = le 2 " C7r/4 = le'^' 2 
and the roots may be represented in abbreviated form by 
1/0; I /tt/2 ; 1/tt; 1 /3tt/2 

or 1 + iO; 0 + i; -1 + iO; and 0 - i. 

Example (ii) Find the fifth roots of unity. 

Solution The roots are given by 

z k = 1 e 2,fc »/s, k = 0, 1, 2, 3,4 

and may be written 

1/0=1; 1/277/5 = 1/72°; 1 /144° ; 1/216°; 1 /288° . 

These roots are plotted on the Argand diagram in Fig. 10.7. 



Fig. 10.7 
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10.5.2 The »th roots of a complex number 

If a complex number is expressed in the Euler form then the roots may 
be found by the method of section 10.5.1. Suppose a = re ifl or 
re i(o+2k77) w h ere r = | fl | an( j 0 = arg a and k is an integer, then 
$a = a lln 

— (r 1 i n )e t <- B+2kir '>i n . 

The roots may then be represented in the form U/r) l(6 + 2kv)jn . In 
the Argand diagram the roots all lie on the circle of radius r, the first 
root (for k = 0) has the argument (Ojn) and the other (n — 1) roots 
are distributed symmetrically around this circle at angular intervals of 
( 2 ir/fi). 

Example Find the cube roots of — 5 + 2i. 

Solution Write 

—5 + 2* = (29)* cis 0 
where tan 0 = — I 

and 6 = 180° - 21° 48' 

= 158° 12'. 

The three cube roots are then 

z* = 29 J cis (158° 12' + A:360°)/3, k = 0, 1, 2 
= 29“ cis (52° 44' + kl20°) 

= 1-753 /52° 44' + fc!20° 
i.e. 1-75 /52° 44' ; 1-75 /172° 44' ; 1-75 /292° 44' . 



In this form the roots may be conveniently plotted on the Argand 
diagram and in Fig. 10.8 they are all seen to lie on a circle of radius 
1-75. The roots may also be written 

1-75(0-6055 + *0-7959); l-75(-0-9919 + *0-1265) 
1-75(0-3864 - *0-9223). 


and 
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10.5.3 The roots of an algebraic equation 

The roots found in section 10.5.2 are the roots of the equation z n = a. 
It is seen that there are n roots. It is a fundamental result in complex 
algebra that any equation of the form 

/(z) es z» + b iZ*- 1 + Z> 2 z“- 2 + . . . + b n = 0 (35) 

where n is a positive integer, has exactly n roots, any multiple root 
being counted the requisite number of times. The equation z n = a is 
simply a particular case of (35). 

Suppose that z = a + if} is a root of (35) then 

/(z) =/(« + i/5) = (a + ift n + ft(a + ift"' 1 + . . . + b n = 0 

and, providing that the b r are real, evaluation of the binomial expres¬ 
sions leads to 

<£(oc, ft + !>(«, ft = 0 (36) 

where <f> and %p are known functions. 

Equation (36) is satisfied by 

#«,/?) = 0 = y(a,ft 
and hence it follows that 

#«, ft “ *>(“> ft = 0. (37) 

Equation (37) shows that a — i/3 is also a root of 

fiz) = 0 

and that complex roots of an algebraic equation with real coefficients 
occur in conjugate pairs. This result can be usefully employed when 
solving such equations. 

Example (i) The roots of the quadratic equation ax 2 + bx + c = 0 
with 4 ac — b 2 < 0 are given by 

x = a + i/S and x = a — i/3 
where a — —bjla and /S = \/(4«c — b 2 )/2a. 

Example (ii) Solve the equation 

z 4 _ 4z 2 + 8z + 35 = 0 
given that 2 + i\/3 is one root. 

Solution Two of the roots are 2 + i\/3 and 2 — iy /3 and hence 
(z — 2 + zV3)(z — 2 — i-v/3) is a factor of z 4 — 4z 2 + 8z + 35, i,e, 
z 2 — 4z + 7 is a factor. 
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By inspection, 

(z 2 -4z + 7)(z 2 + 4z + 5) = 0 
and the other two roots are given by 

z = -2 ±V(4 - 5) = -2 ± i. 

It is important to note that the proviso that the coefficients b, are 
real is necessary for the derivation of (37). When the b, are complex the 
roots of (35), while still n in number, do not occur in complex conjugate 
pairs. For example, the roots of 

z 2 + 2iz + 3 = 0 
are given by z = i and z = —3 i. 

10.6 TRIGONOMETRIC AND HYPERBOLIC FUNCTIONS 

From equation (23) of section 10.4 

e w = cos 0 + i sin 6 
and e “ ie = cos 0 — / sin 0. 

Adding and subtracting leads to 

cos 0 = e< * + 2 C ~ W (38) 

and sin 0 = -—^— (39) 

Recalling the definition of the hyperbolic functions of section 4.5.1 

Q<t> -L Q-<t 

cosh <f> =- 2 - (40) 

e « _ e - * 

and sinh <f> =- 2 — (41) 

and replacing <f> by id gives two important relationships, 

cosh id — -—— = cos 0 

and sinh id = - -— = 1 s i n ® 

Similarly, replacing 0 in equations (38) and (39) by id gives two other 
relationships 
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To each trigonometric identity in 0 there corresponds an equivalent 
hyperbolic identity obtained by replacing 0 by id and using equations 
(43). 

Example (i) cos (A + B) = cos A cos B — sin A sin B 
and so cos {iA + iB) = cos iA cos iB — sin iA sin iB. 

Using equations (43), 

cosh (A + B) = cosh A cosh B — i 2 sinh A sinh B 
= cosh A cosh B -f sinh A sinh B. 

Example (ii) Since 

sin 20 = 2 sin 0 cos 0 
then, using equations (43), 

i sinh 20 = 2i sinh 0 cosh 0 
or sinh 26 = 2 sinh 0 cosh 0. 

10.7 FUNCTIONS OF A COMPLEX VARIABLE 

The complex quantity z or x + iy has, so far, been considered fixed, 
e.g. as 3 + 2i, and represented on the Argand diagram by a fixed point. 
If, however, x and y are permitted to take on different values then z 
will take up different positions in the z-plane and is then called a 
complex variable. 

For example, if z = x + iy and |z| = 1 then z is represented by any 
point on the circle of unit radius, centre the origin, i.e. on the unit 
circle on which x and y vary according to the rule x 2 + y 2 = 1. 

Instead of z being constrained to lie on some particular curve of the 
z-plane it is often required that z takes a position interior (or exterior) 
to a curve, i.e. a position within some region of the z-plane. For 
example, if z = x + iy and |z| < 1 then the complex variable z is 
represented by any point interior to the unit circle x 2 + y 2 = 1 of the 
Argand diagram. 

Any expression that contains the variable z, such as z 2 , 1/z, e* and 
takes a single value for any given value of z is called a function of the 
complex variable z. This function is usually abbreviated to /(z) or 
w(z), i.e. 

w = f(f) =f(x + iy). (44) 

Example (i) If w = z 2 then 

*> = (* + iy ) 2 
= x 2 — y 2 + 2 ixv. 

Example (ii) If w = 1/z then 
w = 1 /(* + iy) 

= (x- iy)/(x 2 +y 2 ). 
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These two examples show that a function of a complex variable is itself 
a complex variable, i.e. 

w = u + iv =f(x + iy) (45) 

and u = u(x, y), v = v(x, y). 

10.7.1 Mapping function 

If z is represented by a point on the Argand diagram then w will be 
represented by another point of the same Argand diagram. For example 
if z = 1 + i then the result above shows that w = z 2 is represented 
by the point 0 + 2i. However, it is usual to indicate the point z on 
one Argand diagram, i.e. the z-plane, and to indicate the correspond¬ 
ing point w on a second Argand diagram, i.e. the w-plane with axes of 
u and v. To each point of the z-plane there corresponds one point on 
the w-plane. As z covers all points on some curve L x or within some 
region R x of the z-plane, the points w cover corresponding points on 
some curve L 2 or some region R 2 of the w-plane. The functional rela¬ 
tionship w = /(z) is said to map or to transform the curve L lt or the 
region R lt into the curve L 2 , or the region R 2 , and is called a mapping 
or transformation function. 

Example (i) Consider the mapping function w = z 2 and the imaginary 
axis x = 0, 0 < y < oo. 

Solution The curve L u the line x = 0, 0 < y < oo, is mapped into the 
curve L 2 given by 

u + iv = (x 2 — y 2 ) + 2 ixy, with x = 0, 0 <y < oo 
or 

u = —y 2 ; v — 0; 0<y<+oo. 

Thus, as z moves outwards from the origin along the positive imaginary 
axis, w moves outwards along the negative real axis. 

Example (ii) Examine the region into which the first quadrant of the 
z-plane is mapped by the function w = z 2 . 

Solution Let z = re ,e and w = pe 10 , 
then pe f * = r 3 e 2ifl 



Figure 10.9 shows the region of the mapping, the first quadrant of the 
z-plane is specified by 0 < 6 < 7r/2, 0 < r < oo and the corresponding 
region of the w-plane by 0 < $ < it, 0 < p < oo. The region of the 
w-plane is the upper half of the w-plane. 

Example (iii) Find the region into which the semi-infinite strip 
0 < y < 77, — oo < x < 0 of the z-plane is mapped by the mapping 
function w = e z . 
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Solution In this case 

w = pe'* = e* = e z+ly 



In Fig. 10.10 the semi-infinite strip is represented by the letters A, B, 
C, D. ' 

Along AB; y = 0, — oo < x < 0 

hence <f> = 0, 0 < p < 1, 

which is represented by A X B X in the w-plane. 



Fig. 10.10 

Along BC; x = 0, 0 < j < ir 

hence p — 1, 0 < (f> < -rr, 

which is represented by the arc B&. 

Along CD; y — tt, 0 > x > — oo 

hence <f> = tt, 1 > p > 0 

which is represented by CiZ>i. 

At an interior point of the semi-infinite strip, x < 0 and hence 
p = e 1 < 1. 
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Thus the interior of the semi-infinite strip 0 < j < tt, — oo < x < 0 
is mapped into the region 0 < ^ < tt, 0 < /> < 1, i.e. into the interior 
of the upper half of the unit circle of the w-plane. It may be noted that 
the boundary ABCD is mapped into the boundary A 1 B 1 C 1 D 1 and that 
the points A and D representing points at infinity in the z-plane map 
into coincident points at the origin of the vv-plane. 

The subject of mapping has many applications in engineering 
wherever a two-dimensional flow problem exists, notably in the fields of 
electricity, electrostatics, fluid dynamics, and elasticity. 

10.8 DIFFERENTIATION OF A COMPLEX FUNCTION 

Suppose z = x + iy to be a complex variable represented by the point 
P of the Argand diagram and let z' be the value taken by z at an adjacent 
point Q as shown in Fig. 10.11. Consider the limiting value of the ratio 

(46) 

as z' ->■ z, where /(z) is a function of the complex variable z. 



There is no restriction on the way in which z' approaches z, that is, 
no restriction on the path taken by Q as it approaches towards the 
point P (a number of different paths are shown in Fig. 10.11) and it 
would be expected that the limit of (46) (assuming it to exist) would be 
different for different paths. In general this is true but it is a remarkable 
fact that there exists a class of functions for which the limit of (46) is 
unique, i.e. is the same for all paths taken from Q to P. Such functions 
are called analytic functions and it is not altogether surprising that the 
theory of analytic functions has been extensively studied and has many 
applications in both theoretical and practical engineering. 

10.8.1 The derivative of f(z ) 

When the limit of (46) exists and is unique it is denoted by f(z) or 
df/dz and is called the derivative of/(z) with respect to z. 
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Thus, 


where Az = z' — z. 

Certain conditions must be satisfied by /(z) in order that the limit in 
equation (47) is unique. Let /(z) = u(x,y) + iv(x,y), then equation 
(47) may be written 

<«> 

Ay-»0 

It is possible to examine the limiting process as z' -> z in a general 
manner but it is sufficient to consider two particular cases. For 
the first let Ay = 0 (i.e. let z'-z be real) and for the second let 
Ax = 0 (i.e. let z' — z be imaginary). When Ay — 0, equation (48) 
gives 




v(x + A x,y) - v(x, y) 1 


du . dv 
S dx + l 8x 


(49) 


and when Ax = 0, equation (48) gives 


f\z) — Lim 


[u(*,y + Aj) - u(x,yf \ 


iAy 


1 8u dv 
~ldy + 8y 


+ Lim [— — 

a»-o L Ay 


Kx,J> + Ay) - v(x, y) 1 


m = 


dv .du 
dy 1 8y 


(50) 


it being assumed that the partial derivatives exist. The condition that 
f\z) has the same value in equations (49) and (50) leads to 
du dv du dv 


dx ' 


dy 


dx 


(51) 


showing that u and v, the real and imaginary parts of/(z), are connected 
by two partial differential equations. These equations are called the 
Cauchy- Riemann equations. 
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Because the equations (51) have been obtained by considering two 
particular limiting processes it is only to be expected that these are 
necessary conditions, i.e. if a function is analytic then the conditions 
(51) are satisfied. However, it may be stated, without proof, that pro¬ 
viding the four partial derivatives exist throughout a region and are 
continuous at all points of the region then the conditions (51) are also 
sufficient, i.e. if the continuous derivatives satisfy (51) then the function 
f(z) is analytic. 


Example (i) If/(z) = z 2 then u = x 2 — y 2 , v = 2 xy and 


du 

Ty~ 


35 - 2 * 


The derivatives exist and are continuous everywhere, the Cauchy- 
Riemann conditions are satisfied everywhere and hence f(z) = z 2 is 
analytic everywhere. 


Example (ii) If/(z) = 1/z then u = x/(x 2 + y 2 ), v = — y/(x 2 + y 2 ) and 
du y 2 — x 2 dv y 2 — x 2 

dx = {x 2 + y 2 ) 2 ’ dy~(x 2 +y 2 ) 2 
du —2 xy dv +2xy 

Ty = {x 2 + y 2 ) 2 ' Tx~ {x 2 + y 2 ) 2 

The derivatives exist and are continuous everywhere except at the point 
x = 0 = y. The Cauchy-Riemann conditions are satisfied everywhere 
except at this point. The function 1/z is therefore analytic everywhere 
except at the point z = 0. 


Example (iii) If /(z) = 


1 — x — iy then u = x. 


du_ 
dy = 


—y and 


The derivatives exist and are continuous everywhere but the Cauchy- 
Riemann conditions are not true anywhere. The function 2 is not 
analytic. 


Example (iv) If/(z) = |z| 2 = x 2 + y 2 then u = x 2 + y 2 , v = 0 and 
du dv n 

dx~ 2x ’ Sy~° 


Ty- 2 ?’ te = 0 - 
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The derivatives exist and are continuous everywhere but the Cauchy- 
Riemann conditions are satisfied only at the origin * = 0 = y. The 
function |z| 2 is analytic only at the point z = 0. 


10.8.2 Harmonic functions 

If a function /(z) is analytic within a region it is an important fact that 
f(z) then has derivatives of all orders. The proof of this statement is 
beyond the scope of this book but, in particular, the second order 
derivatives exist and hence from (51), 


and similarly 


8 2 u 


8 2 v 8 2 v 

8x 8y ~ 8y 8x 


8fu 8fu 
8x 2 + 8y 2 


0 


Pv Pv 
8x 2 + 8y 2 


= 0. 


The functions u and v and hence u + iv 
order partial differential equation 


8 2 V 8 2 V 

8x 2 + 8y 2 


= 0. 


8fu 
' 8y 2 

(52) 

(53) 

solutions of the second 

(54) 


The equation (54) is the two-dimensional form of a famous equation 
called Laplace's equation and any solution is called an harmonic function. 
When u + iv is analytic the two functions u and v are called harmonic 
conjugate functions (note that this is a different use of the word conju¬ 
gate to that used in section 10.2.3). 

Many problems in engineering can be reduced to the mathematical 
problem of finding a function that satisfies the two-dimensional 
Laplace equation within some region and satisfies certain conditions 
on the boundary of the region. In view of equations (52) and (53) it 
may be anticipated that the solution of such problems can be facilitated 
by the use of complex variable analysis. 

Using the notation of Section 9.12 


_ r , / 8 , 8 \ /8V. 8V \ 


_ 8fV 8 2 V 
~ 8x 2 + 8f 

Following the usual vector notation V . V is written F 2 and hence 
Laplace’s equation has the alternative operational form 


V 2 V = 0 . 


( 55 ) 
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Example (i) Show that the function u — cosh 2x cos 2 y is harmonic. 
Use the Cauchy-Riemann equations to deduce the harmonic conjugate 
function v that makes w(x, y) = u(x,y) + iv(x,y) an analytic function 
and express w as a function of the complex variable z — x + iy. 

Solution Differentiating, 
d^u 

— = 4 cosh 2x cos 2y; ^ = —4 cosh 2x cos 2 y 

8 2 u (P'u 

and hence V 2 u = -x-% + -x-s = 0 and u is an harmonic function. Now, 
8x 2 8y 2 

from the first equation (51) 

8v 8u .... _ 

— = — — 2 sinh 2x cos 2y 
8y 8x 

and v — sinh 2x sin 2 \y + g(x) 

where g(x) is an arbitrary function of x only. The second equation (51), 

8v 8u 

■x- — — a~ leads to 

8x 8y 

2 cosh 2 jc sin 2 y + g'(x) = 2 cosh 2x sin 2 y 
and it follows that 

g\x) = 0 

i.e. g(x) = C = constant. 

Thus v = sinh 2x sin 2 y + C 

where C is an arbitrary constant. 

The analytic function w = u + iv is given by 

w = cosh 2x cos 2 y + i sinh 2x sin 2 y + iC 
= cosh (2x + i2y) + iC 
i.e. w = cosh (2 z) + iC 

Example (ii) Show that u — y 3 — 2y — 3jc 2 y and v = x 3 + 2 Xj- 3 xy 2 
+ C are harmonic conjugate functions. 

Express w = u + iv in the form f(z) where z = x + iy and find 
Cif/(0) = 1. 

Solution Differentiating, 

8 2 ul8x 2 = -6y; 8 2 u/8y 2 = 6 y 

and 8 2 v/8x 2 = 6x; 8 2 vldy 2 = —6x ^ ^ ■ 

and hence V 2 u = 0, V 2 v = 0, and u, v, are harmonic functions. 

Also dujdx = —6xy — 8v/8y 
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and 

du/dy = 3/ - 2 - 3x 2 = -(-3/ + 2 + 3x 2 ) = -dv/dx 
and u, v, are harmonic conjugate functions. 

Further, w = u + iv 

= y 3 - 2y - 3 x 2 y + ix 3 + 2ix - 3ixy 2 + iC 

=/(* + iy). 

Setting y = 0 in the last result gives 
f(x) = ix 3 + 2ix + iC 
and so /(z) = iz 3 + 2 iz + iC. 

With /(0) = 1, then iC = 1, i.e. C = —i, and finally 

VP =/(z) = i(z 3 + 2z) + 1. 

EXERCISE 10 

1 Express the following complex numbers in the form a + ib, where 
a and b are real: 

(i) Kc + id), (ii) (1 + i)(2 - 30, 

(iii) (1 + 20/(1 - 0, (iv) (1 + 07(1 - 0, 

(v) 1/(1 + 207 (vi) (1 + 0(3 + 20/(2 - 0- 

2 Given that z = x + iy, find the value, in terms of x and y, and in the 
form a + ib, of (i) z 4 ; (ii) (1 + z)/(l — z); (iii) z + l/2z. 

3 In the relationship (R + ipL)(S — i/pC) = P/Q all the quantities 
are real except i. Show that p = y/(R/LSC) and find R in terms of 
C, L, P, Q, and S. 

4 Express in the form re w , giving the principal value of 0: 

(i) +3; (ii) -3; (iii) 1 + f; (iv) i-l; (v) V3/2 — f/2; 

(vi) — 8 (z'\/ 3 - 1); (vii) (3 + 0/(2f-l); (viii) (l+O 7 8 9 10 * * ; 
(ix) (1 + e tw/3 )(l + e 3i7r;2 ). 

5 Simplify 2/(cos w/4 + z sin tt/4); (V3 - 0 15 - 

6 Solve the equation (x 2 + l) 2 = 4(2x — 1) given that 2/ — 1 is one 
root. 

7 Find all the solutions of z e + 1 = 0 and indicate their positions on 
the Argand diagram. 

8 Find the cube roots of 8(z'\/3 — 1). 

9 Solve the equation z 5 + 1 = 0 and hence deduce that 

Z 5 + 1 = (z + l)(z 2 — 2z cos 7r/5 + l)(z 2 — 2z cos 3tt/5 + 1). 

10 Use de Moivre’s theorem to show that 

cos 50 = 16 cos 5 0 — 20 cos 3 0 + 5 cos 0 

sin 50 = 16 sin 5 0 — 20 sin 3 0 + 5 sin 0. 
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11 Writing z = cos 0 + i sin 0 show that 2 cos nd = z n + z" n and 
2 i sin nd = z n — z ~ n where n is any integer. By expanding 
(z — 1 /z) 4 (z + 1/z) 3 deduce that 

2 e sin 4 0 cos 3 0 = cos 70 — cos 50 — 3 cos 30 + 3 cos 0. 

12 Derive the following identities from the equivalent trigonometrical 
identities 

cosh 2 <j> — sinh 2 <j> = 1, 
cosh 2 </> + sinh 2 <j> = cosh 2 <f> 


= 1+2 sinh 2 <£, 

tanh 2<f> = 2 tanh </>/(l + tanh 2 </>). 

13 Given that 

cos (x + iy) = cos 0 + i sin 0 
where x, y, and 0 are real prove that 

cos 2x + cosh 2 y = 2. 

14 Show that the solutions of cos z = — 3 are the solutions of the 
simultaneous equations 

cos x cosh y = — 3 
sin x sinh+ = 0. 

Hence show that the solutions are 

z = (2r + 1)77- ± i cosh" 1 3. 

15 Describe geometrically the path or region of the z-plane if 

(i) Re z = 2 (ii) Im (z + iz) = 0 (iii) 0 < arg z < tt/ 3 
(iv) |z| <1 (v) |z — 3| = 2 (vi) |z + 3/| > 4 

(vii) zz = 2 (viii) |z/(z — 1)| = \/2 
(ix) 1 < |z| < 2, it /4 < arg z < zr/2. 

16 Test whether the following functions are analytic and find f'(z) 
where possible, stating any restriction on z. 

(i) /(z) = x 2 + y 2 - 2ixy (ii) /(z) = 2x - 3 y + /(3x + 2y) 
(iii) /(z) = sin (1/z) (iv) /(z) = xe 2 . 

17 Write the analytic function /(z) in the form u(r, 0) + iv(r, 0) where 
z = re is , and from the Cauchy-Riemann equations in Cartesian 
form deduce the equations in polar form 

8u 1 8v 1 8u 8v 

8? = ~r 86 and ~r 86 = ~ Tr 

18 Determine value for a, b, c, and d if 

(x 2 + axy + by 1 ) + i(cx 2 + dxy + y 2 ) 
is analytic everywhere. 
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19 Investigate the paths into which the real and imaginary axes of the 
z-plane are mapped under the transformation w = (z + l) 2 - What 
region in the z-plane is transformed into the semicircle u 2 + v 2 < 1, 
v > 0 in the w-plane ? 

20 A, B, and C are the points 1, 2, (2 + i) respectively, in the z-plane. 
Find the paths in the w-plane which correspond to the straight lines 
AB, BC, and AC when w — 1 + 2/z. Sketch on the Argand dia¬ 
gram. (Hint. For AC, evaluate u + v and ( u — l) 2 + v 2 .) 

21 When w = e* show that the straight line x = ky, y > 0 in the 
z-plane is transformed into an equiangular spiral in the w-plane. 

22 Show that u = y 3 — 3 x 2 y is a harmonic function and use the 
Cauchy-Riemann equations to find the harmonic conjugate function 
v where w = u + iv is analytic. Express w as a function of z. 

23 Use the Cauchy-Riemann equations to show that if w = u + iv 
= /(z) is analytic then each member of the family of curves 
u(x, y) = constant cuts each member of the family of curves 
v(x, y) = constant orthogonally. Illustrate by sketching the families 
of curves given by the transformation 

(i) w = z 2 ; (ii) w = 1/z; (iii) z = sin w. 

24 Show that u = x + 3 x 2 y — y 3 is an harmonic function. Find the 
harmonic conjugate function v such that w = u + iv is analytic. 
Express w in the form /(z) where z = x + iy. 



11 

Ordinary Differential Equations 


11.0 INTRODUCTION 

An ordinary differential equation is a relation connecting one inde¬ 
pendent variable say x, one dependent variable say y, and differential 
coefficients of y with respect to x. Many relationships in engineering 
are most easily expressed in the form of a differential equation. For 
example dqjdt = —kq is the differential equation formed from the 
problem of a leaky condenser discharging at a rate proportional to 
the charge q. 

The order of a differential equation is the order of its highest deriva¬ 
tive. The degree of a differential equation is the power of its highest 
order derivative. For example 



is of order 2 and degree 2. 

The solution of a differential equation is obtained by integration. 
The equation is solved when y is expressed in terms of x by a relation¬ 
ship of the form y = /(x) or <f>(x,y) = 0. A differential equation of 
order n will in general have a solution involving n arbitrary constants. 
This solution is called the general solution. If boundary conditions 
are prescribed, these arbitrary constants can be specified. 

The differential equation 

% =/(*), or Dy=f(x) (1) 

has been solved, in Chapter 6, by integration, to give 

y=jf(x)dx + C. (2) 

Similarly the equation 

d n y 

d£ =f{x) ( 3 ) 

may be solved by integrating n times giving 

y = C 0 + C lX + C 2 x 2 + . . . + Cn-jX"- 1 + F n (x) (4) 

where C 0 , C u C 2 , . . ., C n _j are arbitrary constants and F n (x) is the 
result of integrating/(x) n times. 
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11.1 FIRST ORDER EQUATIONS 

11.1.1 One variable absent 

dy 

(i) If the equation involves dyjdx and * only, i.e. if ^ = f(x), then 
integrating both sides : v 

y=jf(x)dx + C. 
dy 

Example Solve jc 3 ^ = 2x 2 + 3. 

Solution Divide by x 3 , x ^ 0 

^_2 3 

dx X X 3 

3 

then y = 2 log |x| - + c > 


where C is an arbitrary constant. 

(ii) If the equation involves dyjdx and y only, i.e. dy/dx = g(y), 
g(y) # 0, then 

dx _1_ 

dy ~ g(y)’ 


and integrating both sides 

* = /£>-• 

dy 

Example Solve ^ = ay, a given non-zero constant. 
Solution Rearranging and integrating, 

,.r*+c 

J ay 


(5) 


= ^log \y\ + C 


or log | y\ = ax — aC 

and this is one form of solution of (5), where C is an arbitrary constant. 
Now 

y — QdX-aC 

= e ai e-° c 

or y = Ae ax , (6) 

where A = e~ aC is also an arbitrary constant. 
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When a logarithm appears in an integral it is often more convenient 
to write the arbitrary constant in logarithmic form. Thus 

ax = log \y\ + log B 
then e ax = By 

or y = Ae ax , A = 1/B. 

The solution y = Ac az is well known as the solution of equation (5), 
since e 1 is defined in Chapter 4 as the particular solution of dy/dx = y 
such that when x = 0, y = 1. 


11.1.2 Variables separable 

Equations of this type are of the form 

g(y)%=f(x) (7) 

or giy)dy = f(x) dx, 

i.e. the variables x and y may be separated. Then the complete solution 
is given by 

jg(y)dy = jf(x)dx+C. (8) 


Example (i) Solve tan x(dyjdx) = cot y. 


Solution Re-write the equation as tan / dy = cot x dx, then 


or 

or 


f sin v , f cosx , _ 

I —— dy = —— dx + C, 
J cosy J J sinx 


—log |cosy| = log Jsin x\ — log c, 
cos / sin x = c. 


where the arbitrary constant is taken in the most convenient form, 
C = —log c. 


Example (ii) A mass m falls vertically from rest from a distance h 
above the ground. Determine the velocity with which it hits the ground 
if the air resistance is proportional to the square of the velocity. 

Solution Let the distance x be measured as positive downwards. 
Velocity v = dxjdt. 


Acceleration = - 


dv dx dv 
dx dt~ V dx 


( 9 ) 


From Newton’s Second Law of motion for constant mass, 


mg — mkv 2 = mv 

where the constant of proportionality is mk. 
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Hence J - dv = j dx + C 

or -^log\g-kv 2 \=x + C. 

The initial condition x = 0, v = 0 determine C = (—log g)/2k so that 



Hence e 2fcI = g/(g — kv 2 ), 

and » 2 = | (1 - e -2fcI ) 

gives the velocity at distance x from the start. 

In particular, when x = h, 

r 2 = | (1 - e- 2fch ). 

Note that when h -*■ oo, v 2 -*■ gjk. That is, no matter from what height 
the mass falls, the velocity will never exceed Vig/k). 


11.1.3 Homogeneous equations 

These are equations of the form 



( 10 ) 


and are reduced to the form of equation (7) in which the variables are 
separable, by the substitution u = yjx. If y = ux, then 


du 

dx 


and substituting fory and dyjdx in equation (10), leads to the equation 


u + x^ =/(«). 


In this equation the variables separate. 

Example (i) Solve the differential equation 

Solution Write y — ux, dyjdx = u + x dujdx, then the equation 
becomes 

2 du 

x z -r = ux log u. 
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If x ^ 0, divide by x and separate the variables so that the equation 
now is 

du dx 
u log u ~ x' 

Integrating, log log \u\ = log |x| + log c, or log u = cx. That is, 
u = e“ and the solution of the equation is 

y = xe CI , 

where c is an arbitrary constant. 


Example (ii) Solve the differential equation 
dy _ 9 x + y 
dx~ x+y 


Solution Writing y = ux, dyjdx = u + x dujdx gives 


u 


du _ 9 + u 
dx ~ 1 + u 


and hence 


du _ 9 — u 2 
X dx~ 1 + u ‘ 

Separating the variables, 

Cdx r (i + u)du ri § * \ 

J x - J (3-«)(3 + «)-J\3-«“3 + «r M - 

Integrating, 

log x = —f log (3-u)-£ log (3 + u) + £ log c, 
where c is an arbitrary constant. Hence 

x 3 (3 - u)\ 3 + u) = c, 

or x 3 (3 - y/x)\3 + y/x ) = c, 

or (3x — y) 2 (3x + y) = c. 


Example (iii) Solve the differential equation 
dy 9x+y+ 2 
dx x+y+l' 

Solution This equation can be made homogeneous by changing the 
variables to X and Y where X = x + y + 1 and Y = 9x + y + 2 
Then dX = dx + dy, dY = 9dx + dy and 
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and the equation becomes 

dY 9X + Y 
dX~ X+ Y 

which is solved in example (ii). Hence its solution is 
(3X - Y) 2 (3X + Y) = c, 
and the required solution is 

(6x -2y- l) 2 (12x + 4y+ 5) = c, 
where c is an arbitrary constant. 


11.1.4 Exact equations 

An ordinary differential equation which can be written in the form 

M(x, y) dx + N(x, y) dy = dF(x, y), (11) 

such that the left side is an exact differential, is an exact equation. 
That is, there exists a function G(x,y) such that 


( 12 ) 


Hence (11) may be written 

dG 8G 

^dx + ^dy = dF(x,y), 


or 

dG = dF. 

Integrating, 

G(x,y) = F(x,y) + C. 

Differentiating equations (12) partially with respect to y and x respec¬ 
tively, leads to 

8 2 G dM 8 2 G _8N 
dydx ~ dy ~ 8x8y ~ dx’ 


and a necessary condition for exactness in (11) is 
8M 8N 
dy dx 


(13) 


That is (11) cannot be exact unless equation (13) is true. It can be 
proved that the equation is always exact when equation (13) is true, 
although the proof is beyond the scope of this book, and equation 
(13) is both a necessary and sufficient condition that (11) is exact. 
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Example (i) Solve y 2 dx + 2xy dy = e x dx 
Solution Re-write the equation as 

d(xy 2 ) = d(e x ). 


and integrating 

xy 2 = e x + C 

where C is an arbitrary constant. 

It can be proved that the left side of (11) can always be made exact 
by multiplication by a suitable integrating factor p(x, y), which has the 
property that 


It is not easy to determine p from this equation. However, certain 
combinations of differentials may be recognised. 

Example (ii) Solve x dy — y dx = xy 2 dx. 

Solution Divide by y 2 and transpose to give 
y dx — xdy 

7 

i.e. the integrating factor is 1 \y 2 so that 


—xdx, 




d\- 1 + xdx = 


Integration gives 

* + £ = 
2 

where C is an arbitrary constant. 


11.1.5 First order linear equations 

The differential equation is of the type 

% + P(x)y = Q(x) (14) 

First, let P{x) = a, a a given constant, so that 

Tx + °y = ew (i5) 

The left side is not an exact differential but can be made so by multiply¬ 
ing by an integrating factor. If Q(x) = 0, equation (15) has the solution 
y — Ae~ ax , or ye ax = A. i.e. d(ye ax ) = 0. This suggests that the left 
side of equation (15) can be made an exact differential by multiplication 
by e M . 
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Multiplying both sides of equation (15) by e“ (^0) gives 
e ai % + ™ a y = e«Q(x) 

or ^(yt-) = ^Q(x). 

Integrating, 

=Je“0(x) dx + C 

or y = e-“ x je aI Q(x)dx + Ce~ ax (16) 

where C is an arbitrary constant. 


Example Solve 


with q = 0 at t = 0, R, C, and V are given constants. Note that C is 
not an arbitrary constant. 


Solution The equation may be written 


d l + J- 
dt ^ RC 


V 

R 


Multiply by e ,/fiC to give 




Integrating, qe tlBC = VCc tlRC + k 

where k is an arbitrary constant. 

Hence q = VC + ke~ tlRC . 

The initial condition q = 0, t = 0 give the value of k = — VC and 
q = VC( 1 - e “ tlRC ). 


Secondly, consider the more general equation (14). This can be 
made exact by multiplication by an integrating factor z(x) to be 

determined. For let P(x) = dz/dx, or z = Jp(x) dx, then equation (14) 
becomes 


dy 

dx 


+ %y = Q(x). 
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Multiplication by e 2 gives 

^(e 2 y) = e 2 g(x), 
from which on integration, 

ey = Je 2 £(x) dx + C (17) 

where C is an arbitrary constant. 

Since z = jp(x)dx 

is known, y is given in terms of x. Hence, multiplying (14) by the 
integrating factor exp (J Pdx) makes the left side of (14) exactly in- 
tegrable. 


Example (i) Solve the differential equation 


dy 

xX °z x te+y 


2 log x. 


Solution Divide by x log x to bring the equation to the standard form 
of equation (14) 


, y = 2 

dx " r x log x x 


( 18 ) 


Here, P(x) = 1/x log x, $Pdx = log |log x| so that exp (JiWx) = log x. 
Multiply equation (18) by the integrating factor log x and find 


, dy y 2 , 

lo g*T: + t = -l 0 g*> 


jj 0> log *) = flog*. 


Integrating, y log x = (log x) 2 + C 

and y = log x + C/ log x 

where C is an arbitrary constant. 


Example (ii) Solve 1 + j — = e -1 sec 2 x —• 

Solution Multiply by dyjdx to obtain the standard form of equation 
(14), 


dy 

fa + y = e 1 sec 2 x. 
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The integrating factor is e*, and hence 

It follows that ye x = tan x -f- C, 

or y = e _I tan x + Ce -1 

where C is an arbitrary constant. 

11.2 SOME SPECIAL TYPES OF SECOND ORDER 
EQUATIONS 

Some types of second order differential equations can be reduced to 
first order by a suitable change of variable. 


11.2.1 Equations with the dependent variable absent 

This type of equation involves d 2 yjdx 2 , dyjdx and * only. It is reduced 
to a first order equation by the substitutions 


dy_ _ <Py_ _ dp 

dx~ p ' dx 2 ~dx 


(19) 


so that the original equation becomes one involving dpjdx, p, and x, 
which is of the first order in p. This may be solved for p as a function 
of x and, by further integration, y obtained as a function of x. 


Example Solve the differential equation 


x 


dy 

dx 2 * dx 


= 0 . 


(20) 


Solution Using (19), equation (20) becomes 

sothat jj + Jf “°- 

Integrating leads to 

log \p\ + log |x| = log A, 
or p = A/x 

where A is an arbitrary constant. 

tt dy A 

Hence — = 

dx x 

and integrating once more, 

y = B + A log |*|. 

This is the solution of equation (20) where B is a second arbitrary 
constant. 
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11.2.2 Equations with the independent variable absent 

This type of equation involves d 2 y/dx 2 , dy/dx, and y only. The substitu¬ 
tions used are 


dy d 2 y _ dp _ dp dy dp 

dx P' dx 2 dx dy dx ^ dy 


( 21 ) 


The differential equation now involves p, dp/dy, and y and is of the 
first order in p. Its solution gives p in terms of y and a further integra¬ 
tion leads to y in terms of x. 


Example (i) Solve the differential equation 

y(d 2 y/dx 2 ) + (dy/dx) 2 = 0 (22) 


Solution The substitutions of (21) used in (22) give 
yipdp/dy) + p 2 = 0 
or dp/p + dy/y = 0. 


Integrating, 

p = My, 

where A is an arbitrary constant. Then 

p = dy/dx = A/y 

and y 2 = 2 Ax + B 

where B is another arbitrary constant. 


(23) 


An elementary application of this method is given by the solution 
of the equation of motion of a particle moving with simple harmonic 
motion as follows in the next example. 


Example (ii) A particle of mass m moves in a straight line under a 
force which is always directed towards a fixed point in the line and 
which is proportional to the distance of the particle from the fixed 
point. Examine the motion. 

Solution Let the fixed point be the origin O and the fixed straight line 
be the x-axis. Let the force per unit mass at distance x from O be n 2 x 
where n is some constant and let t be the time, then the equation of 
motion of the particle is 

—mn 2 x, 

—n 2 x. 


<Px _ 
dt 2 ~ 


(24) 
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Using substitutions corresponding to those in (21), i.e. 

dx d 2 x dp 

dt~ P ’ dt 2 ~ P dx 

equation (24) becomes 


Integrating, \p 2 + \n 2 x 2 = \a 2 n 2 , where \a 2 n 2 is chosen to be the 
arbitrary constant. Then p 2 = n 2 (a 2 — x 2 ) and 

P = § = ±"vV - x 2 ). 

Integrating once more, 

sin -1 (jc la) = ±(«/ + e), 
or x — ±a sin ( nt + e). 

Since a is arbitrary there is no need for the double ± sign and the 
general solution of equation (24) is 

x *= a sin (nt + e). 

Thus the motion is simple harmonic with period 2ir/n. The maximum 
displacement on either side of O is a, which is the amplitude of the 
motion. The angle e is the phase angle. 


11.3 LINEAR DIFFERENTIAL EQUATIONS WITH 
CONSTANT COEFFICIENTS 
These equations have the general form 


d n y 

a n -t— + a„_ 
n dx n 


* % ~ l y 


T + a n . 


d n ~ 2 y 


. + a 0 y= V(x), (25) 


where a 0 , a lt a 2 , ■ . a n are given real constants and V is a given 
function of x. 


11.3.1 The D-notation 

Let D be the operator d/dx so that Dy = dy/dx (see section 2.1.3.) then 
d 2 y d (dy\ 
dx 2 ~ dx \dx) 

is written D 2 y, and D m means the operator d m /dx m where m is a positive 
integer. Then, if m and n are positive integers, 

2Jm + n = D n +m = D m [D nj = D n [L>m] 
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The operator D is an example of a linear operator which is defined as 
follows. An operator <f> is a linear operator if 
(0 <i>(yi + Ja) = 4>{yi) + <f>(y-d and (26) 

(ii) <f>(ky) = k<f>(y), k constant. (27) 

It is simple to show that <f> = D satisfies these equations. Also, 
suppose y =y x and y = y 2 both satisfy the equation 

<Ky) = 0 , (28) 

then y = Ay x + By 2 also satisfies equation (28), where A and B are 
arbitrary constants. For, substituting y — Ay x + By 2 in equation (28), 

<!>{y) = <t>( A yi + fy 2 ) = tiAyJ + <j>(By 2 ) from (26) 

= ^(ji) + B<f>(y 2 ) from (27) 

= 0 , 

since both y 1 and y 2 satisfy equation (28). 

Similarly it may be shown that if y u y 2 , y 3 ,. . ., y n are all solutions of 
equation (28) then so also is 

y = Ayi + A sys + A 3 y 3 + ... + A„y n , 
where A lt A 2 , A 3 , . . ., A„ are arbitrary constants. 

An important property of linear operators is that they may be 
factorised. For example, 

which may be written as 

Writing the operator d/dx as D leads to 

(D*-D- 6)y = (D- 3 )(D + 2 )y, 
or, the operator D satisfies the equation 

D 2 - D -6 = (D- 3)(Z) + 2). 

Any polynomial P(D) in the linear operator D is itself a linear 
operator, for if 

P(D) = a n D n + fln-!/)*- 1 + . . . + a 1 D + a 0 (29) 
where a 0 , a u a 2 , . . ., a n are constants, then P(D) satisfies equations 
(26) and (27) since each term on the right of(29) satisfies these equations. 

11.3.2 The solution of differential equations with constant coefficients 

Equation (25) may now be written 

(a n D n + a n „ 1 D n ~ 1 + . . . + a x D + a 0 )y = K(x)l 
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where the linear operator <f> is P(D) as defined in equation (29). 

The technique for solving equation (30) is as follows: 

(i) let y c be the complete solution containing n arbitrary constants of 
the reduced or homogeneous equation 

<Ky) = 0; 

(ii) let y p be any particular integral of equation (30) so that 4>{y p ) = 
V(x). Then the complete solution of equation (30) is 

(31) 

since <f>(y) = <f>(y c + y v ) 

= <f>(yc) + <Ky t ) 

= 0 + V(x). 

The rest of this chapter is devoted to finding y c , called the comple¬ 
mentary function, and y p called the particular integral, for different 
forms of the function V(x). The complementary function contains the 
arbitrary constants of integration. The particular integral is any 
function, no matter how simple nor how found, which satisfies the 
differential equation. For example it is easy to see that the function 
y p = x is a particular integral of the differential equation. 

= (32) 

The complementary function is A sin (* + B) from the example (i) in 
section 11.2.2, and contains the two arbitrary constants. Hence the 
complete primitive of equation (32) is 

y = A sin (x + B) + x. (33) 

Expanding sin (x + B), 

y = A sin x cos B + A cos x sin B + x; 
and since A, B are arbitrary constants, so are A cos B and A sin B 
and the solution (33) may be written 

y = C sin x + E cos x + x (34) 

where C, E are arbitrary constants. 


11.4 THE COMPLEMENTARY FUNCTION 

Consider the solution of the second order differential equation 

(D 2 + aD + b)y = 0 (35) 

with a and b given real constants. This may be written 

(D - X)(D - p)y = 0, (36) 


where A and p may be real and distinct, real and equal, or complex in 
which case they are conjugate, (see section 10.5.3). To solve equation 


(36), let 


(D - p)y = u(x) 


(37) 
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where u is a function to be found. From equations (36) and (37), 
(2) — A)« = 0 and from equation (6) of section 11.1.1, u — Ae Ax , 
where A is an arbitrary constant. Substituting this value of u in equation 
(37), 

(D - ft)y = Ac Xl (38) 

and this is a first-order equation in y solved as in section 11.1.5. Multi¬ 
plying equation (38) by the integrating factor e - " 1 gives 
Diye-**) = Ae a ~ u)z , 

and integrating gives 

ye - " 1 = jAe ( *~ u)z dx + B 

or y = e“ r J J 4e a ~" >x dx + Bt ux (39) 

where A, B are arbitrary constants. This is the general solution of (36). 
Three cases arise, depending on the nature of the roots A, p of the 
equation 

D 2 + aD + b = 0 
Case (i) A, n real and distinct. 

Equation (39) becomes 

y - e “ x TZTp + B ^ x 

or y = Ce Al + Bc UI (40) 

with B and C arbitrary constants. 

Case (ii) A = p, roots real and equal. 

Equation (39) becomes 

y = e Xz jAdx 4- Be Xz 

or y = e Xz (Ax + 5). (41) 


Case (iii) A, y, complex. 

Let A = a + //?, n = a — //?. From equation (40), 
y = Ce Xx + Be uz , since the roots are distinct, or 

y = e az (E cos fix + F sin /Sx) (42) 

with E, F real arbitrary constants. 

Example (i) Solve the equation 

d 2 y c dy 



ORDINARY DIFFERENTIAL EQUATIONS 


289 


Solution Write as (D 2 — 5D + E)y = 0 
or (D - 2)(D - 3 )y = 0, 

and from equation (40) the solution is 

y = At 21 + Be 31 . 

where A and B are arbitrary constants. 

Example (ii) Solve (Z) 2 + 2D + 1 )y = 0. 

Solution The given equation may be written 
(D + 1 fy = 0, 
and from (41) the solution is 

y = (Ax + B)e>~ x 

where A and B are arbitrary constants. 

Example (iii) Solve (D 2 — 2D + 5)y = 0. 

Solution The quadratic in D will not factorise to give real factors. 
Since 

D 2 - 2D + 5 = (D - l) 2 + 4, 

the values of A and p are 1 ± 2i. From equation (42) the solution is 
y = e z (A cos 2 x + B sin 2 jc), 
where A and B are arbitrary constants. 

The method of solution of the «th order differential equation 
P(D)y = 0, where P(D) is a polynomial in D of degree n, is an 
extension of the method for the second order equation. 

Example (iv) Solve (D - 2)(D - 3) 2 (D 2 + 4 )y = 0. 

Solution The roots of P(D) = 0 are 2, 3, 3, ±2j, and 

y = Ae Zz 4- (Bx + C)e 31 + E cos 2x + Fsin 2x 

where A, B, C, E, and F are arbitrary constants of integration of the 
fifth order differential equation. 


Example (v) A particle of mass m moving in a straight line is acted on 
by a restoring force which is proportional to its displacement from a 
fixed point in the line and also by a resistance which is proportional to 
the velocity. Write down the equation of motion of the particle and 
solve it. 

Solution As in example (ii) of section 11.2.2, the equation of motion 
of the particle may be written in the form 


d 2 x dx 

n li 5=-mn z x-2mk w 
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where the last term represents the resistance proportional to the 
velocity and k is constant. Let D represent the operator d/dt, then the 
equation of motion may be written 

(Z) 2 + IkD + n 2 )x = 0. (43) 

The roots of the quadratic D 2 + 2 kD + « 2 = 0 are 

—k± V(k 2 — « 2 ). (44) 

Three possibilities must now be considered: 

(i) If k > n, V(k 2 — « 2 ) is real, and the roots are real, distinct, and 

both negative since k > \/(k 2 — n 2 ). Let them be —X and —p. The 

solution of equation (43) is then 

x = Ae~ u + Be-** (45) 

where A and B are arbitrary constants. 

(ii) if k — n the roots (44) are each equal to —k and the solution of 
equation (43) is 

x = e~ k XAt + B) (46) 

where A and B are arbitrary constants. 

(iii) if k < n the roots (44) are complex and may be written —k ± //J 
where /? = V(« a — * 2 ) > s real. The general solution of equation (43) 
in this case is 

x = ae~ kt sin ((it + a) (47) 

where a and a are arbitrary constants. 

In all three cases (45), (46), and (47) the value of x approaches zero 
as t -* oo because of the negative exponential factor in each. When 
the value of k, called the damping factor, is equal to n, the rate of 
decay of x is greatest. In this case the motion is said to be critically 
damped. When k > n the motion is heavily damped or overdamped. 
When k < n the motion is oscillatory but the factor t~ kt damps the 
amplitude of the vibration which dies away as t -*■ oo, and the period 
is increased. 

Certain measuring instruments are constructed so that the motion 
of the pointer is critically damped in order that the pointer shall 
return to its equilibrium position as quickly as possible. Sketches of 
three possible distance-time graphs are shown in Fig. 11.1. 

Damped oscillations are important also in electricity where the 
differential equation for a simple oscillatory circuit may be shown to 
be of the form of equation (43). For let q be the charge on the plate of a 
condenser of capacity C which is connected through an inductance L 
and resistance R, as in Fig. 11.2, and let there be no external e.m.f. so 
that the algebraic sum of the voltage drops is zero. 

Let j be the current in the circuit then the two equations 

Lj t + Rj + ^, = 0 and j = dqjdt 
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give the second order equation 


dt 2 ^ L dt^ LC~ 


(48) 


This equation may be compared with equation (43) with q instead 
of x and where 2k = RjL, n 2 = 1/LC, so that 

(i) if Rj2L > 1 ]\/(LC) the charge is not oscillatory and its behaviour 
is sketched in Fig. 11.1 (a) and (b). 
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is y = y c + y p where y c is found as in Section 11.4 and y v is any 
particular integral which satisfies equation (49) and contains no 
arbitrary constants. The form of the particular integral depends on 
the form of V(x). In this section the particular integral corresponding 
to V(x ) as a polynomial in x, an exponential function or as a circular 
function will be investigated. 


11.5.1 V(x) a polynomial in x 

There are several methods for finding particular integrals, one of 
which consists in assuming a particular integral of the same form as 
V(x), and a second defines an inverse operator £> _1 . A third method, 
adopted here, consists in finding a particular integral of a set of equa¬ 
tions. This method is chosen as being the simplest to apply and the 
simplest to justify. It is explained by examples. 

Example-^ i) Find a particular integral of the equation 


(Z> 2 + 2D- 3 )y = 9x 3 . (50) 

Solution Differentiate both sides of equation (50) repeatedly until the 
right side is zero as follows: 

(. D 2 + 2D- 3 )Dy = 21 x 2 (51) 

(D 2 + 2D- 3 )D 2 y = 54* (52) 

( D 2 + 2D- 3 )D 3 y = 54 (53) 

(D 2 + 2D- 3)Z>V = 0. (54) 

From equation (54) a particular integral of the set of equations is 
given by D*y = 0, from which it follows that 

D n+i y = 0, 0. (55) 


With this particular integral, substituting from equation (55) in equa¬ 
tion (53), from equation (55) and (53) in equation (52) and so on, the 
following results are obtained: 
from equation (53), D s y + 2 D*y — 3 D 3 y = 54, 
i.e. D 3 y = —18; (56) 

similarly, using equations (55) and (56) in equation (52) leads to 

D 2 y = -18* _ 12; (57) 

and using equations (56) and (57) in equation (51) gives 

Dy = — 9* 2 - 12* - 14. (58) 

Finally, using equations (57) and (58) in equation (50) gives 
-18* - 12 - 18* 2 - 24* - 28 - 3 y = 9x 3 , 
y p = — 3* 3 - 6* 2 - 14* - 40/3. 


or 
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The complete solution of equation (50) is y c + y p where 
y c = Ae~ 3x + Be 1 


and A, B are arbitrary constants. 

Example (ii) Find a particular integral for the equation 


(D 2 + D)y = x 2 . 

(59) 

Solution Differentiate equation (59) repeatedly after having re-written 
it in the fora (Z ) + i)D y - (60) 

to give (D + \)D 2 y = 2x, 

(61) 

(D + 1 )D*y = 2, 

(62) 

(D + 1 )D*y = 0. 

(63) 

A particular integral is D n+i y = 0, n > 0. 



Substituting 

in equation (62) gives D 3 y = 2, 

in equation (61) gives 2 + D 2 y = 2x or D 2 y = 2x — 2, 

in equation (60) gives 2x — 2 + Dy = x 2 , or Dy = x 2 — 2x + 2, 

and this is a particular integral of equation (59), from which, by 
integration, 

y p = ^x 3 — x 2 + 2x. 

There is no need to add an arbitrary constant of integration as this 
would be absorbed in the arbitrary constant A of the complementary 
function which is y c = A + Bq~ x . 


Example (iii) Solve the dilferential equation 


(64) 


Solution This is a third order equation and the methods of the previous 
sections apply. The equation is 

(D + 1 )D 2 y = 2 + 4x 3 (65) 

from which the complementary function is 

y c = A + Bx + Ce -1 

where A, B, and C are arbitrary constants. To find the particular 
integral, differentiate equation (65) four times and obtain 
(D + 1 )D 3 y = 12x 2 
(D + 1 )D i y = 24* 

(D + 1 )D 5 y = 24 
(D + 1 )D 6 y = 0. 
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Hence a particular integral is D n+6 y = 0, n ^ 0. 

Substituting back, D 5 y = 24, 

D*y = 24x - 24, 

D 3 y = I2x 2 - 24x + 24, 

D 2 y = 2 + 4x 3 - I2x 2 + 24x - 24. 

Integrating twice, 

Dy — x i — 4x 3 + I2x 2 - 22x, 
y„ = }x 5 - x i + 4x 3 - llx 2 . 

The complete primitive of equation (64) is therefore 

y — y c + y„ = Ce~ x + A + £x — llx 2 + 4x 3 — x 4 + Jx 5 . 

11.5.2 V(x) of exponential form 

Consider the second order differential equation 

(D 2 + aD + b)y = e°“ (66) 

where a, b, and a are given constants. 

Since jD(e ai ) = ae ai and Z) 2 (e ai ) = a 2 e ai , 

(D 2 + aD + b)e ai = (a 2 + aa 4- b)e ax . (67) 

Hence 

(D 2 + aD + h){ a2 + e fla ._} = e- 

provided a 2 + aa + b ^ 0, and a particular integral of equation (66) 
is e“7(« z + ax + b) since this function satisfies equation (66). 

More generally if the differential equation is of order n, such that 
P(D) is a polynomial of degree n, the equation 
P(D)y = e ai 

has a particular integral 

y ’ = £) 

provided /’(a) # 0. 

Example Solve (D 2 — 3 D + 2 )y = 20 cosh 3x. (70) 

Solution Since 20 cosh 3x = 10e 31 + lOe" 3 *, there are two particular 
integrals corresponding to 10e 31 and 10“ 31 respectively. 

Now P(D) = D 2 — 3D + 2, so that when a = 3, P(3) — 2, and 
when a = —3, P{— 3) = 20. Hence using equation (69), 

y p = 10 e 3 72 + 10 e“ 3 720 . 

The complementary function derives from 

(- D - 1 )(D - 2)y = 0, 


( 68 ) 

(69) 
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and is y c = Ae x + Be 2x , so that the complete solution of equation (70) 
is 

y = Ae x + Be 21 + 5e 31 + ie~ 3x 
where A and B are arbitrary constants. 

Suppose now that P(ot) = 0. This means that the term e ai on the 
right of equation (68) occurs in the complementary function. It is 
necessary under this condition to use the theorem that 

P(D)(e kx y) = e kx P(D + k)y (71) 

where k is a constant. This is proved by induction as follows. Since 
P(D) is a polynomial in D it is only necessary to prove 

D n (e kx y) = c kz (D + k) n y. (72) 

Assume that, for some fixed arbitrary value of n, equation (72) is 
true, then 

D n '\c kx y) = DD n (e kx y) = D[c kx (D + k) n y] 

= e kx D(D + k) n y + ke kI (D + k) n y 
= e kx (D + k)\D + k)y 
= e x (D + k) n+1 y. 

That is, if equation (72) is true for some fixed arbitrary value of n 
then it is true for n + 1. When n = 1 equation (72) is true, since 

D(e kx y) = e kx Dy + ke kx y = e kx (D + k)y. 

Hence equation (72) is true for all positive integers n > 1, and thus 
equation (71) is true for all polynomials P(D). 

Let the differential equation be 

P{D)y = e«* (73) 

where a is a given constant and P(a) = 0. Write y = e ai «, where u is a 
function of x to be found, and equation (73) becomes 
P(D)(e« x u) = e“. 

Using the result (71) with k = « leads to 

e“P(Z) + a )u = e°“. 

Dividing by e ai (^0) gives 

P(D + a )u = 1, 

and u is found using the method of section 11.5.1. 

Example Solve (D — 2)(D + l)y = e 21 . (74) 

Solution The complementary function is 

y c = Ae 2x + Be~ x , 
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where A and B are arbitrary constants. To find the particular integral, 
since P( 2) = 0, write y„ = e 2l u in equation (74), then 

(D - 2)(Z> + l)(e 2l w) = e 2x 
becomes e 2x (Z> - 2 + 2)(Z> + 1+2 )u = e 2 *, 

or (Z> + 3 )Dh = 1. 

Differentiating, a particular integral is D 2 u = 0 and all higher deriva¬ 
tives are zero, and u = $x. Hence y p = }xe 2x and the complete solution 
of equation (74) is 

y = U + ix)e 21 + 

11.5.3 V(x) is cos to a: or sin u>x 

Since e iax = cos a>x + i sin owe, it is necessary to solve P(D)y — e ta * 
using the methods of section 11.5.2, choosing the real part of the 
solution if V(x) = cos u>x and the imaginary part if V(x) = sin a>x. 
For if the equation is P(D)y = R^x) + iR 2 (x), the solution is y = 
yi + iy 2 where P(D) yi = R^x) and P{D)y 2 = R 2 (x), since 

p (~ D ){y i + '> 2 } = P{D)y x + iP(D)y 2 = R^x) + iR 2 {x). 

In particular, if R 2 (x) is sin tax, the solution is Im(y). 

Example (i) Solve (D + \) 2 (D + 2 )y = cos 3x. 

Solution The complementary function is 

y c = (A + Bx)e~* + Ce~ 2x 
where A, B and C are arbitrary constants. 

For the particular integral, consider 

(D + 1 )\D + 2 )y = e 31 *. 

Now P(D) = (D+ 1 ) 2 (D + 2) so that 

P(3i) = (3 i + l) 2 (3i + 2) 

= -2(17 + 6i). 

Hence ,, = Re [-^ - - e»»] 

= i 1?2 (cos 3x + i sin 3x)J 
= — 2 cos 3x + ~ sin 3xj • 

This may be written cos (3x - 0) where cos 0 = 17/325 and 

sin 0 = 6/325. The complete solution is therefore 

y = (A + Bx)e~* + Ce~ 2x - \ cos ( 3x + 0) 
where 0 is defined above. 
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Example (ii) Solve (D 2 — 2D + 2)y = e 1 sin x. (75) 

Solution The complementary function is obtained from the roots of 
D 2 — 2D + 2 = 0, or (Z> — l) 2 + 1 = 0, and from case (iii) of 
section 11.4, with A = 1 + /, p = 1 — i, 

y c = e x (A cos x + B sin x). 

To find the particular integral consider 

(D 2 -2 D + 2)y = eV 1 = e (1 + 0l . (76) 

With the notation of section 11.5.2, a = 1 + i, P(D) = (D — l) 2 + 1, 
P( a) = i 2 + 1 = 0. 

Hence writing y = e a + ,)x u(x) in equation (76) leads to 

(Z> + 2i)Du = 1, (77) 

and differentiating this once leads to 

(D + 2 i)D 2 u = 0, 

so that a particular integral of equation (77) is D n+2 u = 0, n > 0. 
Substituting back in equation (77) gives Du = 1/2/, and integration 
shows that 

u = x)2i = — \ix 

Thus y p = Im[—\ixe x e iz ] 

= Im[—\ixe z (cos x + i sin x)] 

= — Jxe 1 cos x. 

The complete solution of section (75) is therefore 

y = e 1 (A cos x + B sin x — | x cos x) 
where A and B are arbitrary constants. 

11.5.4 FWisxV” 

In this case it is necessary to write y = e ai «(x) and solve as in sections 
11.5.2 or 11.5.3. 

Example Solve d 2 yjdx 2 + n 2 y — x cos nx. (78) 

Solution Re-write equation (78) as 

(Z> 2 + n 2 )y = xe iBI . (79) 

The complementary function is 

y c = A cos nx + B sin nx, 

where A and B are arbitrary constants. To find the particular integral 


ORDINARY DIFFERENTIAL EQUATIONS 


299 


write y = c inx u in equation (79) and 

(D 2 + n 2 )(e lnz u) = xe lni 
becomes [(D + in) 2 + n 2 ]u = x 

or (D + 2in)Du = x. 

Hence (D + 2in)D 2 u = 1 

and (D + 2in)D 3 u = 0, 

and a particular integral is D n+3 u = 0, n ^ 0. Substituting back and 
integrating leads to the value of u as 

“ = “ 4 ^ + 4 ^' 

Then ^ + ^j(cos nx + i sin «x)J 

= 7-5 cos nx + 7 - sin nx. 

4n 2 4n 

The complete primitive of equation (78) is, therefore 

X x^ 

y = A cos nx + B sin nx + -r-= cos nx + 7- sin nx, 

4n 2 4n 

where A and B are arbitrary constants. 


11.6 APPLICATIONS 

Many vibrational problems in engineering such as occur in oscillating 
electrical networks, the bending of beams, torsional problems, the 
extension of springs, give rise to second order differential equations of 
the types solved in section 11.5. For example, if the electrical network 
of Fig. 11.2 has an applied voltage V(t), equation (48) is replaced by 

dt 2 ^ L dt^ LC L ' 

When V(t) is a polynomial in t, an exponential, or a circular function, 
this equation is solved by one of the methods of section 11.5. When 
V(t) is not of these forms further methods, not discussed here, may be 
used to find the solution. 

11.7 ISOCLINES. GRAPHICAL INTEGRATION OF dy/dx = f(x, y) 
Suppose the solution of the differential equation 

dy/dx =f(x,y) (80) 


is obtainable in the form 


y = <K X , c) 


( 81 ) 
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where c represents the constant of integration. The graph of y = c) 
for any given value of c is called an integral curve. Different values of c 
give the members of the family of integral curves. 

Example (i) Sketch the family of integral curves of the differential 
equation dy/dx — 3x 2 — 2x. 

Solution Here „ „ , 

y = x 3 - x 2 + c 

and the integral curves are cubic curves of the form 
y - c = x\x - l) 
and are shown in Fig. 11.3. 



Fig. 11.3 


Let such a family as (81) be represented by the curves as shown in 
Fig. 11.4. 

The Fig. 11.4 also shows other curves, such as AA', BB', and CC' 
which are the loci of those points, on the integral curves, that have a 
common slope dy/dx or <f>'(x, c ). Such curves are called isoclines and 
Fig. 11.4 indicates isoclines AA', and BB' for which <f>'(x, c) is zero and 
the isocline CC' for which <j>\x, c) is negative unity. The isoclines are 
curves whose equations are given by 


f(x,y) = m. 


(82) 
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In Fig. 11.4, AA', and BB' are isoclines with m = 0 and CC' is the 
isocline with m = — 1. 

However, it is frequently the case that the integral curves (81) are 
not obtainable (i.e. that the differential equation (80) cannot be inte¬ 
grated) but nevertheless the isoclines (82) can be drawn. If a sufficient 
number of isoclines are drawn, together with the line elements indicating 
the slopes of the integral curves that cross the isoclines, as in Fig. 11.5, 
it should be possible to construct, or at least to give an indication of, 
the shape of the integral curve passing through any given point. 

In fact it may not always be possible to sketch the isoclines and even 
if they can be drawn the interpretation of the form of the integral 
curves may be difficult. However it usually is possible to derive some 
indication of the behaviour of the solution to the differential equation 
in the neighbourhood of a point (x 0 , j 0 )- 

Example (ii) Use isoclines to indicate the form of the integral curves 
given that dy/dx = —xjy. 

Solution The isoclines are those curves of equation 
—xjy = m or y = (— ljm)x 


where m is the slope of the integral curves that cut the isoclines. 
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The isoclines are straight lines of slope (—1 jm) passing through the 
origin. Since (m)(— 1/m) = —1, the integral curves must cut the iso¬ 
clines at right angles. The Fig. 11.6 shows some of the isoclines and the 
associated slopes on the integral curves. 

In this case the integral curves are obviously a family of circles of 
equations x 2 + y 2 — c 2 . 

In Example (ii) the differential equation can be integrated by the 
method of section 11.1.2 to give 

jy dy = — jxdx 

i.e. \y 2 = —\x 2 + constant 

which is of the form x 2 + y 2 = c 2 . 

Example (iii) Draw the isoclines for the differential equation 
dyldx = x + y 2 . Construct the portion of the integral curves, for 
x > 0, passing through (a) the point (0, 0), (b) the point (1, 1). 
Solution The equations of the isoclines are given by 

x + y 2 = m 

where m is the slope of the integral curves where they cross the isoclines. 
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The equation 


J 2 = 


— x 


is a parabola whose axis is the x-axis and whose vertex is (m, 0). The 
family of isoclines is shown in Fig. 11.7. The Fig. 11.7 also shows the 
line elements, i.e. the portions of the integral curves where they cross 
the isoclines. The integral curve through (0, 0) has the shape of curve 
(a) and the integral curve through (1, 1) has the shape of curve (b). 
Clearly, if isoclines are drawn at closer intervals it should be possible 
to get a better approximation to the shapes of the integral curves about 
the two points chosen. The Fig. 11.7 also indicates that for x > 1 the 
solution y increases rapidly with increasing x. 


Example (iv) Sketch the isoclines and solution curves of 

dy 


Solution The isoclines are given by the equation 
mx + y = 1 /y 

i.e. yiy + mx) = 1 



Fig. 11.8 
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Comparison with YX = 1 shows that y{y + mx) — 1 is an hyperbola 
with asymptotes y = 0, and y = — mx. Re-writing in the form 
1 — y 2 = mxy shows that the branch of the hyperbola through 
(0, 1) is the one required. The isoclines and associated integral curves 
are shown in Fig. 11.8. 


EXERCISE II 

1 Solve the following differentia] equations: 


<.)5|-3,= 


(iii) x 2 (y 2 + 1) + yV(x 3 + 1 ) ^ = 0 ; 

(iv) V(2xy)^=l; 


(v) sin x + cosh 2 = 0 ; 

(vi) yV(2x* + 3)^ + xV(4 - y 2 ) = 0. 


2 Solve the differential equations 
(0 + e y ) sin x = 0 ; 

dy 

(») y-d~ x ~ x * = x * yl 

(iii) e y ^^ + 1 j = 1 , given that y = 1 when x = 0 ; 

dy 

(iv) (1 — x)y — — 4 — y 2 , and show that the particular curve of 

the system which passes through the origin has equation 
y 2 = 4x(2 — x). 


3 Solve the differential equations 

(i) (x + 1 ) ^ + 2y = Mx + 1 ) 2 ,y = 0 when x = 1 ; 


dy 

(ii) -T- + 2/tanx = 3; 


Y dy dy ’ 


dy 

(iii) * lo g * ^ + y = x log x; 
dy 

(v) cos x j- + y sin x = cos 2x. 
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4 (i) If dyldx + 2y tan x = sin x and y = 0 when x = 0, show that 
y — — 2 when x = tt. 

(ii) Solve the differential equation 

dy 

Tx +y 

(iii) Solve the differential equation 

dy 

5 Use the substitution y — ux to solve 

(i) x(x + .>0 % = y( x ~ y)l (u) 3x 2 ^ = x 2 + 2 y 2 ; 

(iii) * 2 % = y(2x + y); (iv) x a + f + 2 xy & = 0; 

(v) (x 2 -xy)^+y 2 = xy. 


6 Show that the following differential equations are exact and find 
their solutions; 


(i) (x +y)dx + (x + y 2 ) dy = 0; 

(ii) (2xe v + e 1 ) dx + (x 2 + l)e y dy = 0; 

(iii) (2 xy +y 2 )dx + (x 2 + 2x>> -y)dy = 0. 


7 Solve the differential equations 


(0 y 


d 2 y [dyY 
5x 2+ \dx) 


given that when x = 0, y = 1 and dyjdx — 0; 


wS+W®’-'. 


where k is a given constant. 

8 A particle moves in a straight line, its distance x from a fixed point 
of the line at time t is given by the equation 
d 2 x , (dx\ 2 
dt 2+k \dt) ~^ X ’ 

where k and y, are positive constants. If initially the particle is at 
rest at the point x 0 = 1/(2/:), obtain an expression for the velocity, 
and show that the particle moves with constant acceleration 
fi/(2k) in the direction of increasing x. 
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9 Obtain the complete primitive for each of the following: 

(i) (D 2 - 3D - 10)/ = 0; (ii) ( D 2 + 3 D)y = 0; 

(iii) (D 2 + 4 D + 4)/ = 0; (iv) ( D 2 - 4D + 13)/ = 0; 

(v) (D 4 + 4m 4 )/ = 0, where m is constant. 


10 Find the most general solution of the following equations such that 
(a) / = 0 when x — 0; (b) / = 1, dyjdx = 0 when x = 0. 


®£ + 4 + *- 0 ; 


dx 




11 Determine particular integrals for the equations 

(i) (D 2 + 4 D + 3)/ = x 2 ; (ii) (D 2 + D)y = 3x + 7. 

12 Obtain the complete primitives of the following equations 

(i) (D 2 + 2D + 5)/ = 4 + 2x; (ii) (D 2 -4D + 4 )/ = x 2 ; 

(iii) (5 D 2 -4D- 1)/ = 2x 2 ; (iv) (D 4 + 3 D 2 )y = x 2 . 

13 Determine particular integrals for the following equations: 

(i) (D 2 + 4D + 3)/ = e*; (ii) (D 2 + 4 D + 3)/ = 3e-^; 

(iii) (D 2 -2D+ 1)/ = e*; (iv) (Z) 2 + 5 D + 6)/ = xe" 1 ; 

(v) (D 2 + 4)/ = xV; (vi) (3D 2 - 10Z) + 3)/ = 2e 31 . 


14 Solve the following differential equations: 

(i )g + 2 | +5 ,-( 4 + 2 *)e-; 


<»S-i£+£—• 


rfx 2 ^ " 


15 Determine particular integrals for the differential equations: 

(i) (Z) 2 + 3 D)y = 5 cos x; (ii) (Z) 2 + 9)/ = 2 cos 3x; 

(iii) (D 2 - 2D + 1)/ = e 1 sin x; (iv) (D 2 - 2D + 2)/ = e 1 sin x; 

(v) (D 4 + 2D 2 + 1)/ = sin x. 

16 Solve the following differential equations 

(i) (D 2 + 9)/ = 4 cos 2x; (ii) (D + 4)/ = cos x; 

(iii) (D 2 + 5D + 6)/ = cos x + sin 2x. 

d^x dx 

17 (i) Solve — 4 + 4x = 0, given that x — 0, dxjdt = 2 when 

t = 0. 

d^x 

(ii) Solve -^2 + x = 3 sin 2t, given that x = dxjdt = 0 when 
f = 0. 
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18 (i) Solve the differential equation 
d 2 y dy 

J + «i + 9>-e- 


given that when x = 0,7 = 0 and dyjdx = — 1. 

(ii) Find the most general solution of the differential equation 
(D 2 — 3D — 4) 2 y = 0, such that dyjdx = 0 when x = 0 and the 
ratio y\x tends to zero as x tends to zero. 

19 Solve the equations 

(i) (D — l) 2 j = xe 1 , given that / = 1, Dy = \ when x = 0; 

d 2 y dy 

(ii) ^2 + 4^ + 3^ = 8e r , given that when x = 0, y = 0 and 
dyjdx = 4. 

20 Solve the differential equations 

d 2 y dy 

(i) + 3^ + 2y = 4e~*, given that y and dyjdx are zero when 

x is zero; 
d 2 y dy 

(ii) + 2 ^ + 10 y = sin 3x, given that y = 0, dyjdx = 1 when 
x = 0. 


21 Solve the differential equations 

(i) (D 2 - 2D + 1)7 = x 3 ; (ii) (D 2 - 10Z> + 29)7 
(iii) (9D 2 + 6D+ 1 )y = 0; (iv) (D 2 + 2 D+ 1)7 = 
(v) (D 2 + 3D + 2)7 = e*; (vi) (D 2 + 2 D + 5) a 7 = 


22 Integrate the differential equations 




47 = 2 cos 2x; 


(v) 


d^y 




57 = ^(e 1 sin x + x). 


e 1 cos 3 jc; 
+ l; 


23 Sketch the isoclines of the differential equation 
dyjdx = (x- y)jx 

for x > 0. Use these to sketch the solution curves for x > 0. 
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24 Use the method of section 11.1.3 to show that the general solution 
of the differential equation dy/dx = 1 + (y/x) is y — x log Ax where 
A is a constant. Sketch the isoclines of the differential equation, show 
the solution curve passing through (1, 0), and compare this with the 
actual solution curve through that point. 

25 Sketch the isoclines of the differential equation 

dy/dx = (x 2 + y 2 )/x 

and use them to sketch the solution curve near to the point (2, 0). 



12 

Analytical Properties of Algebraic 
Equations 


12.0 INTRODUCTION 

This chapter is concerned with some of the simpler analytical properties 
of algebraic equations particularly those of polynomial type. These 
properties are of use in subsequent chapters. 


12.1 THE REMAINDER THEOREM 

This states that if f(x) is a function possessing a Taylor series at the 
point x = a then the remainder on dividing by (x — a) is /(a). 

Proof 

The Taylor series for f(x) is 

/(*) = /(«) + (* - a)f'(a) + (x- a)T(a)/2\ + . . . 

= /(«) + {x - dyf>(x) 

where 

<£(*) =/'(«) + (*- a)r(a)l2\ + . . . 

Hence f(x)/(x - a) = </>(x) + /(a)/(x - a) 

in which <j>(x) is called the quotient and /(a) is called the remainder. 

In particular if f(x ) is a polynomial of degree n then is a poly¬ 
nomial of degree n — 1. 

Example (i) If x i — lx 3 + x — 2 is divided by (x + 2) the remainder is 
(-2 y - 7(—2) 3 + (-2) - 2 = 68. 


Example (ii) If x sin 2x is divided by (x — tt/ 4) the remainder is 
(77/4)(sin7r/2 ) = tt/4. 
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12.1.1 The factor theorem 

This states that if (x — a) is a factor of /(x) then f(a) is zero and is an 
immediate consequence of the remainder theorem. 

Example (i) Let f(x) be x 3 + 2x 2 — 5x — 6, then by trial 
/(2) = 8 + 8 - 10 - 6 = 0 
and (x — 2) is a factor of /(x). By inspection 

f(x) = (x - 2)(x 2 + 4x + 3) 

= (x - 2)(x + l)(x + 3). 

Example (ii) Let /(x) be 1 — sin x then by trial /(tt/ 2) = 0 and 
(x — 7 t/ 2) is a factor of 1 — sin x. But 
f(x) = 1 -sinx 

= /(W2) + (x - tt/2)/'(W2) + (x - W2)Y>/2)/2! + . . . 

= 0 + 0 + (x - 7 t/ 2) 2 /2 ! + . . . 
which shows that (x — n/2) is a double factor of 1 — sin x. 

12.2 SOLUTION OF EQUATIONS. MULTIPLE ROOTS 

It follows immediately from Taylor series and section 12.1 that the 
equation /(x) = 0 has a root at x = a if /(a) = 0. This is a simple 
root if f'(a) # 0; a double root if f\a ) = 0, f"(a) ^ 0; a triple root if 
f'(a) —f'ia) — 0,/'"(a) ^ 0; and so on. The condition that /(x) = 0 
has n equal roots x = a is that /(a) = f'(a) = f"(a) = ... = / <n - 1) (a) 
= 0 ,f (n Xa)=£0. 

If /(x) is a polynomial of order n then the equation /(x) = 0 can 
have at most n roots x = x lf x = x 2 , x = x 3 , . . ., x = x n because 
successive division of /(x) by the factors (x — x^, (x — x 2 ), . . . 
(x — x n ) gives a final quotient which is constant. On the other hand a 
general equation/(x) = 0 where /(x) has a Taylor expansion can have 
an infinity of roots. For example, the equation sin x = 0 has an infinity 
of roots given by x = nr, where r is an integer. 

12.3 DETACHED COEFFICIENTS. SYNTHETIC 
DIVISION 

It is often necessary to divide a polynomial by a linear expression 
(x — a) in order to examine the quotient and remainder. There are 
two ways in which the familiar long division process may be displayed. 
The first is by using detached coefficients in the long division process 
itself and the other is a condensed version of this display called synthetic 
division. To illustrate the methods consider the division of 3x 4 — 7x 3 + 
4x 2 - 3x - 5 by (x - 2). 
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12.3.1 Division using detached coefficients 

The reader should have no difficulty in following the long division 
process employed. 

1 -2) 3 -7 +4 -3 -5 (3 -1 +2 +1 
3 -6 

-1 +4 
-1 +2 


+2 -3 
+2 -4 


+ 1 -5 
+ 1 -2 


The quotient is 3x 3 - x 2 + 2x + 1 and the remainder is -3. This 
process starts by dividing 3x 4 by x to give the first coefficient 3 of the 
quotient, this coefficient is then multiplied by the coefficient -2 of the 
divisor and subtracted from the next coefficient —7 of the dividend to 
give the term —1 of the third row. This process is repeated until no 
further division is possible. 

12.3.2 Synthetic division 

The process of section 12.3.1 may be conveniently contracted by 
omitting the irrelevant detail and changing the sign of the coefficient 
—2 of the divisor so as to make additions instead of subtractions. 
The process is displayed below: 



3 -7 +4 

-3 

-5 

+2 

+6 -2 

+4 

+2 


3 -1 +2 

+ 1 

L=L 


The procedure is as follows: 

(i) write the dividend in detached coefficient form, any absent 
terms being shown by a zero coefficient, 

(ii) show the multiplier +2 (associated with the divisor, x — 2), 

(iii) carry the first coefficient of the dividend, i.e. 3, into the last row, 

(iv) multiply the first coefficient of the last row by +2 and place the 
product +6 in the second place of the second row, 

(v) add the two terms of the second column to give — 1 as the second 
term of the last row, 

(vi) multiply the second coefficient of the last row by the multiplier 
+2 and add to the third coefficient of the dividend, 



ANALYTICAL PROPERTIES OF ALGEBRAIC EQUATIONS 


313 


(vii) continue the process, 

(viii) the last term of the last row is the remainder and the preceding 
terms form the coefficients of the quotient. 

Example Use synthetic division to obtain the quotient and remainder 
when 3a 5 + 9x 4 — x 2 + 1 lx is divided bv (x + 3). 


Solution 

-3 

3 +9 
-9 

+0 

0 

-1 +11 

0 +3 

+0 

-42 



3 0 

0 

-1 +14 | 

-42 


The quotient is 3x 4 — x + 14 and the remainder is —42. Note the 
zero coefficients in the dividend corresponding to the coefficients of 
x 3 and constant term. 


12.3.3 Division of a polynomial by (ax — b) 
Let /(x) be a polynomial of degree n then 


/(*) = ir fix) 1 
ax — b a[x — 6/aJ 


= -<K*) + 


fm 


where <f>(x) is a polynomial of degree n — 1. 

The process is then seen to be as follows: 

(i) divide/(x) by the expression (x — b/a ), 

(ii) the coefficients of the quotient obtained are a times too large but 
the remainder term is correct. 


Example (i) Divide 21x 4 — 7x 3 + 3x 2 — lOx — 1 by (3x — 1). 


Solution 

21 

-7 

+7 

+3 

0 

-10 
+ 1 

-1 

-3 


21 

0 

+3 

-9 

|-4 


The quotient is 7x 3 + x — 3 and the remainder is —4. 
Example (ii) Divide 16x 4 + 24x 3 — 3x — 2 by (2x + 3). 


Solution 


16 +24 

+0 

-3 -2 


-f 

-24 

0 

0 +f 



16 0 

0 

-3 L+t_ 


The quotient is 8x 3 — I and the remainder is +f. 
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12.3.4 Repeated division 

Repeated division of a polynomial by a linear expression may be 
easily displayed by the synthetic division process. 


Example Divide 6x 5 + 80x 2 — 1 by (x + 2) 3 . 

Solution 16 0 0 + 80 0 — 1 

-2 -12 +24 -48 -64 +128 



6 -12 

+24 

+32 

-64 

+ 127 | 

-2 

-12 

+48 

-144 

+224 



6 -24 

+72 

-112 

+ 160 


-2 

-12 

+72 

-288 




6 -36 

+ 144 ^ 

-400 



Hence 


6x 5 + 80x 2 - 1 
(x + 2) 3 


= 6x 2 — 36x + 


144- 


400 160 

(x + 2) + (x + 2) 2 


+ 


127 

(x + -2) 3 


It may be noted that these terms are the partial fractions relevant to 
(x + 2) 3 . 

Continued division may be used to express the given polynomial as 
a polynomial in the linear divisor. For example, continuing the example 
above 


6 -36 

+ 144 

-12 

+96 

6 -48 

+240 

-12 


6 | -60 



from which it follows that 

6x 5 + 80x 2 - 1 = 6(x + 2) s - 60(x + 2) 4 + 240(x + 2) 3 

—400(x + 2) 2 + 160(x + 2) + 127. 

From the Taylor series it is evident that the coefficients +127, +160, 
—400, +240, —60, and +6 are the successive terms /(—2), /'(—2), 
2),/"'(—2),/ <4) (—2) and/ <5) (—2) respectively. 
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12.3.5 Division of a polynomial by a quadratic expression 

Synthetic division may be extended to include the division of a poly¬ 
nomial of degree greater than two by a quadratic expression ax 2 + 
bx + c. Consider the long division of 3x 4 + 5x 3 — 15x 2 + lOx — 1 
by x 2 + 2x — 3. In detached coefficient form this is 

1 + 2-3)3 +5 -15 +10 -1(3 -1 -4 
3 +6 -9 


-1 -6 +10 
-1 -2 +3 


-4 +7 -1 

-4 -8 +12 


+ 15 -13 


and the quotient is 3x 2 - x - 4 the remainder being (15x - 13). 
As in section 12.3.2 this process may be contracted by omitting un¬ 
necessary repeated coefficients and changing the signs of the coeffi¬ 
cients +2 and —3 in the divisor so that any subtraction is replaced 
by addition. The display is similar to that of synthetic division by a 
linear expression but an additional row of coefficients is required. The 
first row of coefficients is obtained from the dividend. The second row 
is obtained from the products of the multiplier —2 with the coefficients 
of the quotient and the third row consists of the products of the multi¬ 
plier +3 and the coefficients of the quotient. The last row consists of 
the coefficients of the quotient and remainder. The condensed array 
is shown below 


-2 
+ 3 


and the procedure is as follows: 

(i) carry the first coefficient of the dividend into the first place of the 
last row, 

(ii) multiply this coefficient by the multiplier —2 and enter the product 
in the second place of the second row. Multiply the same coefficient 
by the second multiplier +3 and enter the product in the third 
place of the third row. 
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(iii) add the elements of the second column and enter in the second 
place of the last row, 

(iv) repeat the process of (ii) and (iii) entering the products into the 
third and fourth places of the second and third rows respectively, 

(v) repeat the process until the last coefficient of the third row is 
found. Add the coefficients of the last two columns, 

(vi) the coefficients of the linear remainder are the last two coefficients 
of the last row. The coefficients of the quotient are the remaining 
ones of the last row. 

The procedure may easily be adapted to include a divisor with a 
coefficient of x 2 other than unity. As in section 12.3.3, the scheme is 
written for a divisor x 2 + (bja)x + (c/a), the remainder will then have 
the correct coefficients but coefficients of the quotient will be a times 
too large. 

Example Divide 3 jc 3 + 2x 2 — x + 5 by 3x 2 — x + 2. 

Solution The scheme becomes 

13 +2 -1 +5 1 



and the quotient is (x + 1), the remainder (— 2x + 3). 

12.4 RELATIONSHIPS BETWEEN THE ROOTS OF A 
POLYNOMIAL EQUATION 

Consider the roots of a cubic equation 

a 3 x 3 + a 2 x 2 + a ± x + a 0 — 0. 

Let the three roots be a. u a 2 , a 3 then 

a 3 x 3 + a 2 x 2 + a 3 x + a 0 == a 3 (x — «j) (x — a 2 )(x — a 3 ) 
and comparison of like coefficients leads to the following relationships 
+ «2 + «3 = —a 2 ja 3 'J 

a 2 <*3 + *3*1 + «i*2 = aja 3 j (1) 

ociocaocg = —a 0 ja 3 J 

The left sides of equations (1) are the sums of the products of the roots 
taken one at a time, two at a time, and three at a time respectively. 
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Example (i) If cl u a 2 , a 3 are the roots of 2a: 3 + 3 x 2 - 17x + 12 = 0 , 
then 


«1 + *2 + «3 = —3/2 
a 2 a 3 + <*3 a i + *i«2 = -17/2 

a l a 2«3 = —6. 

Extension of this result to the n roots of an nth degree polynomial 
equation 

a n x n + fln-iX"- 1 + a„_ 2 x B-2 + . . . + a& + a Q = 0 
leads to the relationships 

2a, = 

2a,a, = 

2a,a y a fc = 


a 1 a 2 a 3 . . . a„ = 

in which the left sides are the sums of the products of the roots taken 
one at a time, two at a time, three at a time, up to n at a time. 

It should be noted that the relationships do not in themselves solve 
the original equations. For example, elimination of a 2 and a 3 in 
equations (1) simply leads to 

«3*i 3 + tf 2 a i 2 + <*iai + a 0 =0. 

However, if some additional information about the roots is known, 
then these relationships may help to solve the equation. 

Example (ii) Find the roots of 8x 3 + 18x 2 — 17x + 3 = 0 given that 
one root is double one of the others. 

Solution Suppose the roots are a, 2a, and /? then, 

3a + 0=-18/8 (i) 

2a 2 + 3a/3 = -17/8 (ii) 

2a 2 /J = - 3/8 (iii). 

Eliminating /? between (i) and (ii) gives 

56a 2 + 54a - 17 = 0 

which leads to 

a = 1/4 or a = -17/14. 

From (i) /? = — 3 or (l = 39/28 and of the two possibilities a = f, 
P = —3 satisfies (iii) but a = —17/14, ft = 39/28 does not. 


+a n - 2 /a n i =£j J 
-a„_ 3 /a„ i =£j ^ k 


(- l) n a 0 /a B 
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12.4.1 Diminishing the roots of an equation 

The polynomial equation 

a n x n + an-i** -1 + • ■ . + a 0 = 0 
and the polynomial equation 

a n (x — k) n + h„_i(jc — Ar) n_1 + . . . + b 0 = 0 

where the b t are obtained, as in section 12.3.4, by dividing the first 
equation n times by (x — k ), where k is a constant, have the same 
roots x lt a 2 , a 3 , . . ., a„. The equation 

<*ny n + Vi/- 1 + b n - 2 y n -* + . . . + b 0 = 0 
therefore has roots (a x — k), (a 2 — k), . . ., (a„ — k). This process of 
forming a new equation whose roots are each k less than the original 
is called diminishing the roots by k. Similarly the roots can be increased 
by k. 

Example Diminish the roots of x 3 — 6x 2 + 9x — 4 = 0 by 2. 

Solution By section 12.3.4 

I 1 -6 +9 -4 1 

+2 +2 -8 +2 



and hence the roots of y 3 — 3y — 2 = 0 are each 2 less than those of 
the given equation. 

12.5 LOCATION OF THE ROOTS OF A POLYNOMIAL 
EQUATION WITH REAL COEFFICIENTS 

The process of root location is usually a trial and error one but there 
are a number of properties that may be used as aids to location and 
classification of roots. 

(a) If x 2 > and f(x 2 ),f(x 1 ) are of opposite signs then there is at 
least one root of f(x) = 0 in the range ^ < x < x 2 . This is obvious 
geometrically. 

Example (i) If f(x) = x 3 + 2x + 1, then, since /(—1) < 0 and 
/(0) > 0, there is at least one root between x = — 1 and x = 0. 

(b) If a and /J are successive real zeros of a polynomial /(x) then 
/'(x) has an odd number of real zeros in the interval a < x < p. 
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This property follows from Rolle’s theorem of section 5.1.1 because 
/(a) = 0 = /GS),/(x) is continuous and fix) exists. Stated differently 
the real roots of fix') = 0 are separated by the real roots of fix) = 0. 

Example (ii) If fix) = x 3 — lx + 2, then f'(x) = 3x 2 — 7 and the latter 
has two roots x = ±V(7/3). Since Lim f(x) < 0, /(— V(7/3)) > 0, 

/(+ V(7/3)) < 0 and Lim f(x) > 0 there are three changes of sign for 

fix) and it follows from (a) that /(x) = 0 has three real roots. One of 
these roots is in the range — oo < x < — \/(7/3), one in the range 
— a/(7/3) < x < + -\/(7/3), and one in the range V(7/3) < x < + oo. 

It may be stated that if fix) = 0 has (n — 1) unequal roots a!, a 2 , 
a 3) . . ., a n _! in ascending order and if the signs of the terms of the 
series /(— oo), /(aj, /(a 2 ), /(a 3 ), . . .,/(a„-i),/(+ oo) alternate then 
fix) — 0 has n unequal real roots. 

(c) Since complex roots occur in conjugate pairs (see section 10.5.3) 
then a polynomial of even degree has an even number (including zero) 
of real roots and a polynomial of odd degree has an odd number of 
real roots (i.e. at least one real root). 

(d) The Descartes’ rule of signs states that if fix) is a polynomial 
written in descending powers of x then 

(i) the number of positive real roots of fix) = 0 is not larger than the 
number of variations of f(x) and, 

(ii) the number of negative real roots is not larger than the number of 
variations of/(—x). 

In applying the rule, /(x) is said to have a variation if two successive 
terms have opposite signs. 

Example (iii) Show that x 5 — llx 3 + 2x 2 + 12 = 0 has two complex 
roots, two positive real roots, and one negative real root. 

Solution Since /(—oo) < 0, /(0) = 12 > 0, /(2) = — 36 < 0, and 
/(+ oo) > 0 then, by (a), there must be at least one negative real root 
and at least two positive real roots. Descartes’ rule of signs (d) states 
that f{x) = 0 has not more than two positive real roots and not more 
than one negative real root because /(x) has two variations and /(—x) 
has one variation. 

There are therefore exactly two positive real roots and one negative 
real root and hence two complex roots. 

(e) A rough graph is helpful in the location of the roots of an equa¬ 
tion f{x) = 0. For example, a sketch of the curve y = x 3 — 3x — 1 
from x = -2 to x = +2 will show that there is one root between 
—2 and —1, one root between —1 and 0, and one root between 1 
and 2. 

Alternatively a sketch of the two graphs of y = x 3 and y = 1 + 3x 
will show the three intersection points. 
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12.5.1 Procedure after location of a root 

If a root x = a of a polynomial equation is known or can be exactly 
located then the polynomial may be divided by (x — x) to give a 
quotient and a zero remainder (the evaluation of the latter serves as 
a check on the working). An attempt may then be made to locate 
the zeros of the quotient and hence the further roots of the original 
equation. 

It is likely that the roots of a polynomial cannot be found exactly 
and in this case an approximate value must be obtained. Methods of 
approximate root location will be discussed in the next chapter. 

EXERCISE 12 

1 Divide 3x 6 -x 5 + 4x 3 -l by (i) x - 2, and (ii) 3x - 1. 

2 Factorise the function 4x 4 — 6x 3 + 4x 2 — 3x + 1. 

3 Express 

(3x 7 - 36x 6 + 162x 5 - 324x 4 + 244x 3 - 13x 2 + 51x - 58)/(x - 3) 4 
as the sum of a quotient and partial fractions. 

4 Find the quotient and remainder when 4x 6 — 7x 4 + 24x 3 — 18x 2 + 13 
is divided (i) by x 2 - 2x + 3 and (ii) by 2x 2 + 3x - 1. 

5 Using the relationships between the roots find the equation whose 
roots are the squares of those of 3x 3 + 2x — 1 = 0. 

6 Solve the equation 12x 3 — 20x 2 + x + 3 = 0 given that the sum 
of two of the roots is 2. 

7 Show that 6y 3 + 19 y 2 + 1Q> — 11 = 0 is the equation obtained 
by diminishing by three the roots of 6x 3 — 35x 2 + 59x — 35 = 0. 
Show that there is a root y = \ and hence solve completely the 
equation in x. 

8 Show that fix) = 6x 4 — lx 3 — x — 6 = 0 has one negative real 
root, one positive real root, and two complex roots. 

9 An approximate root of x 3 + x 2 — 7x + 1 = 0 is x = 2-10 
correct to two decimal places. Divide the polynomial by (x — 2-10) 
and find approximations to the other two roots. 

10 Show that x 3 — 3x — 1 = 0 has three real roots, one between 
x = —2 and x = — 1, one between x = — 1 and x = 0, and one 
between x = 1 and x = 2. Show that the substitution x = r cos d 
and the identity cos 30 = 4 cos 3 0 — 3 cos 0 transforms the given 
equation to cos 3 0 — \ provided that r — 2. Solve this equation for 
0 and hence find all the roots of the given equation. 

11 Show that the method of Question 10 applied to the equation 
x 3 — 3x — 4 = 0 leads to roots x = 2 cos 0 where 0 is a solution 
of cos 36 = 2. Solve for 0 using the method of Question 14, Exer¬ 
cise 10 and hence find all the roots of the given equation. 
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Numerical Methods I 


13.0 INTRODUCTION 

The roots of an equation f(x) = 0, or the solution of a system of 
equations, may be found by either graphical or numerical processes. 
The former may be used if accuracy is not important, there being a 
limit to the accuracy with which a graph can be drawn and read. 
The numerical processes start with an approximation to the solution, 
often obtained from a rough graph, and then use a systematic scheme 
of numerical computation to improve the value. Such a process stops 
when the value is found to the required degree of accuracy. Any 
numerical process should, as far as possible, include some form of 
checking procedure so as to eliminate the propagation of errors. 
Among the numerical methods there is the iterative method. This 
starts with an approximation to the required value and by a given 
form of calculation produces a better approximation, which is then 
improved by the same form of calculation, the process terminating 
when two successive iterations give identical value to the degree of 
accuracy demanded (e.g. to the same number of decimal places). This 
method is the basis of most high speed computer programs, the 
computer performing a large number of iterations in a short space of 
time. It is possible for an iterative process to lead away from the 
required root, that is the process could diverge instead of converge. 
Under certain conditions the process may diverge from the required 
root but converge to some unwanted root. Usually it is necessary to 
ensure that certain criteria are satisfied by the equation or system 
of equations before commencing an iterative process. If these criteria 
are not satisfied then it may be necessary to employ an alternative 
process. 

13.1 GRAPHICAL METHODS 

When accuracy is not important, a root of the equation/(x) = 0 may 
be found by plotting the graph of the function y — fix') over some 
range of values a < x < b for which f(a) and f(b) are of opposite 
signs. The root is then found by reading the abscissa of the crossing 
point with the x-axis. This is a process that needs no elaboration. 

13.1.1 Intersection of two graphs 

Instead of drawing the graph of y — f(x) and finding the intersections 
with the x-axis it is often an advantage to write /(x) in the form 
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fix) — y(x) and then draw the graphs of the two functions/ = fix) and 
y = y>(x). The abscissae of the intersection points give the required 
roots of fix) = 0. 


Example The roots of the equation 1 + x 2 — 3 tan x = 0 are ob¬ 
tained from the intersections of / = 1 + x 2 and / = 3 tan x. These 
curves are easily drawn to any reasonable accuracy and the abscissae 
of the intersections points obtained. The curves are indicated in 
Fig. 13.1 and from this rough sketch the smallest root is found to be 
approximately 0-4. 



13.1.2 Repeated plotting on a larger scale 

An approximation x = x 0 to a root of f(x) = 0 may be improved by 
drawing the part of the graph y = fix), or of the parts of the graphs 
y = fix) and y = fix), near to x = x 0 to a larger scale and locating 
the root with additional accuracy. Instead of drawing the curves 
accurately it is usual to replace them by straight lines drawn through 
the end points of the interval about x = x 0 . This introduces an error 
into the new estimate of the root so that the process is repeated until a 
sufficiently accurate result is obtained. 
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Example Continuing the example of section 13.1.1 the following 
table may be constructed 

x <K X ) = 1 + x 2 yfx) — 3 tan x 


0-3 1 09 0-93 

04 116 1-27 

from which it is seen that the curves of y = <f>(x) andy = y(x) intersect 
between x = 0-3 and x = 04. Replacing the curves by straight lines 
through the end points of each curve as in Fig. 13.2 gives the improved 
value of the root as 0-36. 



An estimate of the error involved is found by calculating /(0-36). 
This is 0-0004, and the result 0-36 may be accurate enough to stop the 
process at this stage. Improved accuracy may be obtained by drawing 
the straight lines through the end points corresponding to x = 0-360 
and x = 0-361. This leads to the estimate x = 0-3601. 

At this stage it may be claimed that the result x = 0-360 is correct 
to three decimal places. 

13.2 NUMERICAL METHODS OF SOLVING f(x) = 0 
The graphical solution of section 13.1.2 can be represented numerically 
as an iterative process known as the rule of false position and this is 
discussed in section 13.2.1. In addition two other methods of iteration 
are considered. One is discussed in section 13.2.2 and the other, a 
simple but also a very powerful process, known as Newton’s method, 
is described in section 13.2.3. 

13.2.1 Rule of false position or method of linear interpolation 

Suppose a root of the equation /(x) = «£(x) - v ( x ) = 0 has been 
located between the values x 0 and x 0 + a (the smaller the value of a 
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the better), then reference to Fig. 13.3 shows that a better approxima¬ 
tion to the root will be given by x = x ± = x 0 + h where, using similar 
triangles, 


h 

a — h 


V(x q) - <f>(x 0 ) 

<K X 0 + a) - y>(x 0 + a) 


= g (say) 



The second approximation to the root is 



It may be noted that if ip(x) = 0 then <f>(x) = fix) and A, B are simply 
|/(x 0 )[ and j/(x 0 + a) j respectively. 

Example (i) Calculate the root of 1 + x 2 — 3 tan x = 0 that lies in 
the range 0 < x < 1. 

Solution Let <f>(x) = 1 + x 2 and y>(x) — 3 tan x, then it is easily 
seen that there is a root between 0 and 1. Suppose the first approxima¬ 
tion to be x 0 = 0, then the following table may be constructed and the 
next approximation found by an application of equation (1): 

x rp{x) <f> ~ V 


0 10 1 
1 2 4-67 -2-67 

With a = 1, A = 1, B = 2-67, then h = (1/3-67) x 1 
= 0-27 
— 0-3. 
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The second approximation is then 0-3. Proceeding as before: 

x </>(*) vKx) <f> - v 

0-3 1-090 0-928 +0-162 

0-4 M60 1-268 -0-108 

and here, a = 0-1, A = 0-162, B = 0-108, from which 
h = (162/270) x 0-1 = 0-06. 

The third approximation is 0-36, then 

X <K X ) V>( x ) <j> -y> 

0-360 1-129 60 1-129 21 +0-000 39 

0-361 1-130 32 1-132 64 -0-002 32 

and here, a = 0-001, A = 0-000 39, B = 0-002 32, and hence 
h = (39/271) x 0-001 = 0-000 1. 

The fourth approximation is 0-360 1. 

Note 1 The difference <f> — y> not only provides the values of A and B 
but the change in sign of the difference shows that there is a root 
between the two values of x used in the calculation. 

Note 2 With this method it is necessary to retain more decimal places 
in each successive calculation, five figure tables were used in the 
last approximation. 

Example (ii) Find the real root of 

fix) == x 3 — 6x 2 + 12x - 24 = 0 
to two decimal places. 

Solution Since /(4) = — 8 and /(5) = +11, there is a root between 
x ~ 4 and x = 5. Diminish the roots by 4 using the results of section 
12.4.1: 



then F(z) = z 3 + 6 z 2 + 12z — 8=0 has a root between 0 and 1. 
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Now successively apply (1) to F{z) starting from the value z 0 = 0. 
Then 

z F{z) = z 3 + 6z 2 + 12z — 8 


0 -8 
1 +11 

and here, a = 1, + = 8, B = 11, hence 

h = (8/19) x 1 = 0-4. 

The second approximation is z x = 0-4, then 
z F(z) 

0-4 -2176 

0-6 +1-576 

and here, a = 0-2, A = 2-176, B = 1-576, hence 
h = (2176/3752) X 0-2 = 0-12. 

The third approximation is z 2 = 0-52, then 

* _ m 

0-52 +0-003 0 

0-51 -0-186 7 

and here a = 0-01, A = 0-186 1, B = 0-003 0, and hence 
h = (1867/1897) x 0-01 = 0-009 8. 

(Note that the sign of F(z) when z = 0-52 shows that the third approxi¬ 
mation has overshot the root.) 

The fourth approximation is z 3 = 0-519(8). At this stage it may be 
claimed that the root is z = 0-52 correct to two decimal places. The 
root of fix) = 0 is then 4-52. 

In practice this method is seldom used after the second stage. A 
value having been found the approximation is improved by using a 
more powerful method such as the Newton-Raphson method of 
section 13.2.3. 

13.2.2 Another iterative method 

There is a useful iterative method available when fix) = 0 can be 
written in the form x = <f>ix). The iterative procedure is given by 

x n+1 = <£(*~) 

where x n is the nth iterate and the procedure ends when the required 
accuracy has been obtained in two successive iterations. 
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The figures 13.4(a) and (b) illustrate two convergent iterative processes 
* 0 , *!, x 3 , x 3 . . one converging directly to the root and the other 
oscillating about the root value. The Fig. 13.4(c) illustrates a process 
that diverges from the root value. It is obviously possible with this 
process to start with an approximation very close to the required 
value and then to calculate values that are further and further away 
from the true value. It is a necessary and sufficient condition for 
convergence that \<f>'(x)\ < 1 in the neighbourhood of the root. This 
criterion is satisfied for the curves of Fig. 13.4(a) and (b) but not for 
Fig. 13.4(c). 



(<) 

Fig. 13.4 


Example Find the root of the equation 

x 3 - 2x 2 + 3x — 1 = 0 

near to 0-5. 

Solution Write 

*« + i = i(l +2xl- *2) 
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and start from x 0 = 0. The iteration proceeds as follows: 


x o = 0 

x 5 = 0-430 

x x = 0-3 

x 6 = 0-430 1 

x 2 = 0-38 

x 7 = 0-430 14 

x 3 = 0-41 

x 8 = 0-430 16 

x 4 = 0-42 

x 9 = 0-430 160 


The root is therefore 0-430 2 correct to 4 decimal places. This procedure 
does not have rapid convergence but is very simple to apply. 

A major disadvantage of the method is that it may be possible to 
rearrange f(x) = 0 in a number of different ways so as to produce 
the form x = <f>(x) and while one (or more) of these arrangements 
can lead to a convergent process another may lead to a divergent one. 
For the purpose of illustration consider the equation 
x 2 — 4x + 3 = 0 

which has roots at x = 1 and x = 3. The iteration 
x n+1 = 4 —3 /x n ^Ux n ) 

for which |<£'(1)| > 1 and |</>j(3)| < 1, will be convergent near the 
root x = 3 but divergent near the root x = 1, i.e. it is impossible to 
locate the latter root using this iteration. However the iteration process 

*»+i = 3/(4 — x„) = <f>a(x n ) 

for which |^g(l)| < 1 and \<f>s(3)\ > 1 will converge to the root x = 1 
and diverge from the root x = 3. 

In employing this method care must be taken to see that the iteration 
is converging to the required root. A rough sketch is frequently an aid 
to the making of the correct choice of function <f>(x). 


13.2.3 The Newton-Raphson method 

Suppose that x 0 is an approximation to the root of the equation 
/(x) = 0 then the Taylor series expansion of/(x) in the neighbourhood 
of x 0 is given by 

fix) = fix 0 ) + (x - x 0 )/'(x 0 ) + (x - x 0 ) 2 + - • • (2) 

If Xj is a value of x nearer to the exact root, then (xj — x 0 ) will be 
small, /(Xi) is approximately zero, and providing that /"(x 0 ) is not 
too large the Taylor series may be replaced by 


0 =/(x 0 ) + (Xi - x 0 )/'(x 0 ) 


x 1 = x 0 — 


fix o) 

~/W 


from which 


(3) 
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This is the Newton-Raphson rule and repeating for successive approxi¬ 
mations x u x 2 , x 3 , . . . leads to the iterative relationship 


*n + l 


x n 


f(Xn) 

f'(Xn) 


(4) 


where x n is the «th iterate. The geometrical interpretation is shown in 
Fig. 13.5. 



The ordinate at A 0 is f(x 0 ) and, since tan 6 0 = f'(x 0 ), the tangent 
at A 0 intersects the x-axis in the point x x . The process illustrated in 
Fig. 13.5 is seen to be convergent. 

As a working rule the number of reliable decimal places in the 
quotient /(x n )//'(x„) is usually double the number of zeros between 
the decimal point and the first significant figure (i.e. the number of 
correct decimal places is approximately doubled at each iteration). 
The final answer is obtained when two successive iterates agree to the 
requisite number of decimal places. 

It is evident from Fig. 13.5 that the convergence will be rapid provid¬ 
ing that /'(x) is large near to the root but becomes slower as /'(x) 
approaches zero. In the latter case it becomes necessary to compute 
f(x n ) and f'(x n ) to high accuracy to obtain an accurate value of x B+1 . 

Under certain conditions the convergence may be too slow for hand 
computation or may even diverge. The difficulty is frequently overcome 
by plotting a rough curve to a larger scale near to the root or roots 
and examining the geometrical situation. Different cases of convergence 
are illustrated in Fig. 13.6. 
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In Fig. 13.6(a) the process is convergent, the iterations being succes¬ 
sively on either side of the root. In Fig. 13.6(b) the process converges 
to the root at R and neither of the roots at P or Q are located, in this 
case a starting approximation much nearer to P (or Q) will lead to a 
convergent process to P (or Q). In Fig. 13.6(c) the process is divergent 
starting from the approximation x 0 . As in the previous case the root 
can be located by a convergent process starting from an approximation 
nearer to the root itself. 




(c) 

Fig. 13.6 


Example (i) Find the smallest root of 1 + x 2 — 3 tan x = 0. 

Solution Write 

fix) — 1 + x 2 — 3 tan x 
then f'{x) = 2x — 3 sec 2 x. 

Take the first approximation to be x Q = 0-3 (being the second approxi¬ 
mation obtained by the rule of false position) then 
/(0-3) = 016 

/'(0-3) = -2-58, 

and X! = 0-3 - (0-16/—2-58) 

= 0-3 + 006 
= 0-36. 
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With Xx = 0-36, 

x 2 = 0-36 — /(0-36)//'(0-36) 

= 0-36 - 0-000 4/(—2-705 1) 

= 0-36 + 0-000 147 87 

i.e. x 2 = 0-360 148 ^ 0-360 15. 

Using five figure accuracy,/(0-360 15) is 0-000 08. The result is prob¬ 
ably correct to the fifth decimal place but a further iteration would 
be necessary to confirm this. 

Example (ii) Find the root of 

x 3 - 6x 2 + 12x - 24 = 0 

near to x = 4. 

Solution This is example (ii) of section 13.2.1 in which the rule of 
false position was used to show that x = 4-52 correct to two decimal 
places after three iterations. 

Let f{x) = x 3 - 6x 2 + 12x - 24 

and /'(x) = 3x 2 - 12x + 12 

= 3(x - 2) 2 

and start with x 0 = 4, then 

Xj = 4-0 - (-8/12) = 4-6 

x 2 = 4-6 - (1-576/20-28) = 4-6 - 0 077 8 = 4-52 (say) 
x 3 = 4-52 - (0-003 10/19-051 3) 

= 4-52 - 0 000 162 7 = 4-519 84 
x 4 = 4-519 84 - (-0-000 02/19-048 79) 

= 4-519 84 + 0 000 001 049 9 
= 4-519 841 049 9 

After four iterations the root is 4-519 84 correct to five decimal places. 

A useful application of the Newton-Raphson rule is to the calcula¬ 
tion of square roots. If x == then x is a root of/(x) = x 2 - a = 0 
and the rule then states that if x n is the nth iterate 

*+i = *»- (/(+.)//'(*»)) 

= x„ — (x 2 — a)l 2x„ 

i- e - *»+i = i(x „ + a/x n ). ( 5 ) 

Thus, if x n is an approximation to the square root of a number a then 
the mean of x n and the quotient a/x n is a better approximation. Each 
iteration is correct usually to twice as many decimal places as the 
preceding iteration. 
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Example (iii) Find \/5. 

Solution Take the first approximation as x 0 = 2, then 
xj. = i(2 + 2-5) = 2-25 
x 2 = K2-25 + 2-222 2) = 2-236 1 
x 3 = 4(2-236 1 + 2-236 035 96) = 2-236 067 98 
The result 2-236 1 is correct to four decimal places. 

Note 1 A major advantage of the Newton-Raphson method is that 
any error made is not disastrous providing that the process, starting 
from the incorrect value, remains convergent. 

Note 2 If f'(x) is large the rate of convergence is large and the denomina¬ 
tor term f'(x) may be conveniently replaced by a suitable constant. 
For instance, in example (ii) above, f'(x) could be replaced by 20, 
the successive iterates being x 0 = 4, x x = 4-4, x 2 = 4-509, x 3 = 
4-519 1. 

Note 3 A measure of the accuracy of the iterate x n is given by the value 
of/(x n ). This value is called the residual and usually denoted by R. 

Note 4 If the coefficients of the equation f(x) = 0 are known only to a 
certain accuracy then the approximations to the roots of the equation 
cannot be calculated with certainty to a greater accuracy. For 
instance if the coefficients are known to be correct only to two 
decimal places then there is no advantage in calculating the roots 
to many more decimal places. 

Note 5 If (a) a small change in the value of the coefficients of f(x) leads 
to a large change in the value of the roots, or 

(b) small values of the residual R can be obtained even when 
the values of x n are widely different from the true value of the root, 
then the equation f(x) = 0 is said to be ill-conditioned. Equations 
that are ill-conditioned are difficult to deal with by any method of 
numerical analysis. In the Newton-Raphson method it frequently 
means that fix) and f'{x) are both small and that the calculations in 
each iteration must be carried out with the retention of many more 
decimal places than can be claimed with accuracy in the result of the 
iteration. 

13.3 NUMERICAL SOLUTION OF LINEAR ALGEBRAIC 
SIMULTANEOUS EQUATIONS 

The elimination method of Gauss was used in section 8.6.1 to solve 
a set of linear simultaneous algebraic equations. The numbers given 
there were simple integers and the calculations were performed using 
only mental arithmetic. The matrix scheme can be used even when the 
elements are decimal but it is more convenient to draw up a computa¬ 
tional scheme and to introduce a system of checking. This scheme will 
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be discussed in section 13.3.1. As in section 13.2 it is possible to form 
an iterative process that will converge to the solution of the set of 
equations and two such methods are discussed in sections 13.3.2 and 
13.3.3. 

13.3.1 Gaussian elimination method 

Given a set of three equations in three unknowns x, y, and z: 

412x - 9-68 y + 2-01z = 4-931 
1 -88* - 4-62 y + 5-50z =311 (6) 

1-IOjc — 0-96/ + 2-72z = 4-02 J 

the basic process is first to eliminate x and y and form an equation 
in z only, second to use this to form an equation in y only and finally 
to form an equation in x only. In terms of matrix algebra this is equiva¬ 
lent to transforming the coefficient matrix into an equivalent triangular 
matrix. 

A simple check is introduced, and assuming no numerical mistakes 
are made (or left uncorrected) the errors in the final answer are due 
solely to rounding off errors in the multiplication and division used. 
Finally an estimate of the error may be formed and a correction 
applied. The computational scheme proceeds as follows: 



The first estimate of the solution is given by the b column of (6.12), 
(6-13), and (6.14). The columns marked R are columns of residuals but 
are not computed until after the first estimate is found. The columns 
y, z, and b together form the coefficients of the augmented matrix 
of the set of equations. The column marked a is the check column and 
the first three entries in this column are simply the row sums of the 
entries in the previous four columns. The elimination process proceeds 
by performing the operations as indicated in the last column, the aim 
being to eliminate the successive variables until the final coefficient 
matrix is in a triangular form. As an aid to computation it is useful 
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to reduce to unity the coefficient of the variable being eliminated. At 
each stage of the elimination the entry in the or column is computed 
by performing the operations upon the previous entries of that column. 
This entry is then checked against the sum of the other entries of the 
row. For example, in forming row (6.9) the entry <r 9 is given by o a -F 
1-625, i.e. 6-511/1-625 or 4-007. This entry is then checked against the 
sum of the other entries in that row, namely 4-006 which indicates 
that there is probably no major numerical error. 

The first estimate of the solution is 

= 4-210,^ = 1-333, and z x = 0-246 

with the third decimal place in doubt. The column of residuals R a) 
is now computed as follows. The first entry is obtained by inserting 
the estimates x u y x , and z x into the original equation ( 6 . 1 ) written in 
the form 


4-12*! - 9-68^! + 2 0lz i ~ 4 ' 93 - 

The second and third entries in the R (1> column are found in like 
manner from the original equations (6.2) and (6.3). The first three 
entries are 0-006, —0-001, and 0-000. Note that if the estimates had 
been exact then the residuals would all have been zero. The remaining 
entries in the R (1) column are computed by performing the appropriate 
operation on the previous entries. Finally, the entries b of (6.12), 
(6.13), and (6.14) are adjusted by subtracting the corresponding residual 
value. The second estimate of the solution is then 


x 2 = 4-210 - 0-002 1 = 4-207 9 
72 = 1-333 - 0-000 1 = 1-332 9 
z 2 = 0-246 + 0-000 8 = 0-246 8 

The first three entries in the second column of residual values may now 
be found by inserting these new estimates into the original equations. 
The new residuals are 0-000 1, 0-000 3, and 0-000 4 respectively. The 
rest of the column may then be computed and the third estimates 
found. 

The above method of solution solves the set of equations in one 
computational process and, providing the equations are not ill- 
conditioned and sufficient accuracy is maintained in all multiplications 
and divisions, the first estimate obtained should be very close to the 
correct answer. In contrast to this process it is possible to formulate 
an iterative process starting from a rough estimate of the solution and 
then gradually converging to the correct result by successive calcula¬ 
tions of the same kind. There are two basic methods in common use, 
both modifications of a method originally due to Gauss. 
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13.3.2 The Gauss-Jacobi method 

Suppose that a set of equations in three unknowns is given in the 
form 

flu* + a 12 y + a 13 z = b x \ 

a 21 x + a 22 y + a 23 z = b 2 J (7) 

a 31 x + a 32 y + a 33 z = b 3 J 

and that the equations are already in the form, or by suitable adjust¬ 
ment of the augmented matrix may be put into the form, in which the 
diagonal terms of the coefficient matrix are large compared with the 
other terms. Then the equations may be written 

* <r+1) = (bi - a 12 y lr) - a I 3 z (r) )/a u 'j 
y r +u _ (* 2 _ a 21 x {T) — a 23 z< ry )la 22 ( 8 ) 

z (r+1) = (b 3 - a 3 ix <r) - a 32 y ir) )/a 33 ) 
where x (r) , y (r) , and z <r> are the rth iterates. The procedure starts by 
assuming an estimate x <0) , / 0) , and z <0) , which, in the absence of any 
better estimate may be taken as 0,0, and 0. The second estimates may 
then be found from equations (8). The results are set out in table form 
and it is usually soon evident whether the process is convergent or not. 
It is possible for the process to be slowly convergent (as for example 
when the diagonal coefficients are not much larger than the other 
coefficients of the matrix) or even to diverge. 

For purposes of illustration of the method, consider the following 
example. 

Example Solve the set of equations 

8 x— y + z — 18 — 01 
2x + 5y-2z- 3 = 0. (9) 

x+ y-3z+ 6 = 0 j 

Solution The iterative scheme is applied to the equations 
x — (y — z + 18)/8 \ 

y = (~2x + 2z + 3)/5 . (10) 

z = (y + x+ 6)/3 j 

Then, 

x (r+1 > = 0125y< r) - 0-125z (r) + 2-250 \ 
y r+1) = —0-400x (r) + 0-400z (r) + 0-600 (11) 

z (r+1 > = 0-333j (r) + 0-333x (r) + 2-000 J 

and the iteration starts with x m = j <0) = z (0> = 0. Successive iterations 
are performed and the results are given in the table below 


Iteration 

0 

i 

2 

3 

4 1 

X 

0 

2-250 

2-07^ 

1-943 9 

2-011 4} 

y 

0 

0-600 

0-500 

0-949 6 

0-965 7 


0 

2-000 

2-949 

2-858 2 

2-964 4/ 
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The process is clearly convergent, the actual values being x = 2, 
y — 1, and z = 3. 

13.3.3 The Gauss-Seidel method 

This method is a modification of the Gauss-Jacobi method. The set 
of equations (7) are written in the form 

x (r+1) = (7>i - a 12 y m - a 13 z (T> )!a xl \ 

V< r+1) = (b 2 — fi 21 * (r + 1> — a 23 Z<r) )/ a 22 (13) 

Z ( r+ 1 ) = (6 3 _ a 31 x ir+1) - a 32 y (r+1) )ja 33 ) 

which takes the place of equations ( 8 ) of section 13.3.2. 

The procedure starts as before with an initial estimate x (0> , y (0> , and 
z (0> and then x (1> is found from the first of the equations (13), y (1> from 
the second and z U) from the third. It will be noted that this calculation 
for the second estimate differs from the previous method in that the 
current values of the variables at each stage of the iteration are used 
in proceeding to the next stage of the iteration. The Gauss-Jacobi 
iteration proceeds by whole steps whereas the Gauss-Seidel method 
proceeds by single steps. The advantage of the latter is that convergence 
is more rapid but there is the disadvantage that the calculations are 
not so simple to perform. 

Example Starting from the set of equations (10) with the first estimates 
x io) _ y(oy _ z (o> _ o, solve the equations (9). 

Solution The following table may be formed using equations (13), 


Iteration | 0 

l 

2 

3 

4 

5 

x 1 0 

2-250 

1-881 3 

1-997 24 

1-997 86 

1-999 83 

y o 

-0-300 

0-907 5 

0-972 94 

0-996 88 

0-999 37 

z 0 

2-650 

2-929 6 

2-990 06 

2-998 25 

2-999 73 


In five iterations the answer is close to the exact result of x = 2, 
y = 1 , and z = 3. The result of the third iteration is better than that 
of the fourth iteration using the Gauss-Jacobi method. 

Both the iterative methods are suitable for use with a high speed 
computer. 


13.4 NUMERICAL SOLUTION OF DIFFERENTIAL EQUATIONS 
The numerical solution of a differential equation of the nth order is 
usually obtained by solving an equivalent system of n first order 
differential equations. Consider for example, a general third order 
differential equation expressed in functional form 


(Ey 

dx 3 




dy d 2 y \ 
dx’ dx 2 ) 


(14) 
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Using the substitutions dy(dx = z; dzjdx = w means that the equation 
(14) may be replaced by the system of equations 

£-■«** o (15) 

d £=f( x ,y,Z, W )) 

The solution of (14) may then be considered as equivalent to solving 
the three simultaneous equations (15). It is necessary then to discuss 
the numerical solution of a general first order differential equation of 
the form ,/ v 

■£ =fix,y) 

13.4.1 Euler’s method of solution 

Consider the equation 

%. =f(.x,y) (16) 

together with its boundary conditions 

y=y 0 ;x = X 0 (17) 

Suppose the solution of (16) is represented by a function 
y = 4>(x), x = x 0 ,y= y 0 , 

then the solution would be represented graphically by the integral 
curve passing through the point ( x 0 , y 0 )- Such a curve is illustrated in 
Fig. 13.7. If the function <f>(x) can be found analytically then the 



Fig. 13.7 
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equation (16) is solved and there is no need for a numerical solution. 
In practice however the function <f>{x) may not be available so that the 
Fig. 13.7 is simply a theoretical solution curve. If ^(=^ 0 + h) is the 
abscissa of a point P on the solution curve near to (x 0 ,y 0 ) then the 
actual ordinate of P is 0 (jcj) or <f>(x 0 + h). 

The Euler method of solution is a simple computation that gives an 
approximation to the actual value <j>(x 0 -f h). Analytically the approxi¬ 
mation is to take the first two terms in the Taylor expansion of 
<f>(x 0 + ft) (see section 5.2), i.e. 

<K*o + h) — <f>(x 0 ) + h<f>'(x 0 ) = y 0 + h 

This value is then taken as the value y± corresponding to x lt i.e. 

yi=y 0 + hf(x 0 , y 0 ) (18) 

Graphically the process is to draw the tangent line to the integral 
curve at the point ( x 0 ,y 0 ), as in Fig. 13.7, and to construct the point 
P E where the tangent line intersects the ordinate at x x = x 0 + h. For 
this reason the Euler method is sometimes called the tangent method of 
approximation. The whole process may now be repeated starting from 
the value (x^yx) to compute the next approximation given by (x 2 , y 2 ) 



Fig. 13.8 
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where y 2 = Ji + hf{x 1 ,y 1 ), x l = x 0 + h, x 2 = x 0 + 2h (19) 

The successive points (x 0 , Jo)> ( x i>Ti)> (* 2 > J 2 ) are then approximations 
to the actual points on the solution curve through (x 0 , y 0 ) as in Fig. 
13.8. 

If y n is the approximation to the solution of (16) at the point 
x n = x 0 + nh starting from the point (xr 0 , j 0 ), then the next point is 
(*„ + i, yn+i) such that 

y n+ i = yn + hfixy n ) (20) 

Example (i) Find an approximation for /(0-3) if 
dyjdx = 2xjy 

and y(0) = 2. 

Solution 

Here 

f(x, y) = 2x/y and x 0 = 0, y 0 - 2 
h = 01, 

X! = 01 

yi=yo + hf(x 0 , jo) 

= 2 + (o-ixo) 

= 2 ; 

then x 2 = 0-2 

and J 2 = Ji + hfix lt yi) 

= 2 + (0TX0-2/2) 

= 201 ; 

then x 3 = 0-3 

and J 3 = Ja + hfix 2 , y 2 ) 

= 201 + (0-1X0-4/2-01) 

= 2 0299. 

Hence j(0-3) is approximately 2 03. 

The differential equation of Example (i) can be solved analytically by 
separation of variables (see 11.1.2) and the solution is j 2 /2 = x 2 + A. 
With j(0) = 2, then A = 2 and hence the particular solution is 
y 2 = 2(x 2 + 2). The actual value of j(0-3) is thus V 4 ‘l 8 or 2 045 to 
three decimal places. The solution of Example (i) may be written in 


Choosing 

then 

and 
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table form as follows 


n 

0 

1 

2 

3 

x n 

0 

01 

0-2 

0-3 

/» 

2 

20 

201 

2-03 

h 

01 

0-1 

01 


Ax n ,y n ) 

0 

01 

0199 


y n+ i 

20 

201 

2-0299 



Example (ii) Find an approximation to y( 0-6) if 



and y(Q) = 0. (Use an interval h = 0-2). 

Solution Setting the solution out in tabular form and working to three 
decimal places 


» 

0 

1 

2 

3 

x n 

0 

0-2 

0-4 

0-6 

yn 

0 

0-0 

0-04 

0-120 

h 

0-2 

0-2 

0-2 


V; 

II 

+ 1 

> 

0 

0-2 

0-402 


Jn+1 =y n + hf(x n ,y n ) 

0 

004 

0-120 



The result is j>(0-6) =e= 0-120. 

There is no simple analytical solution to the differential equation of 
Example (ii), but this is the differential equation of Example (iii) of 
section 11.7 and Fig. 11.7 indicates the solution using isoclines. When 
x — 0-6 on the relevant solution curve (a) which passes through (0, 0), 
y is approximately 0-2. On this curve, it is seen that y increases rapidly 
with x and it is to be expected that Euler’s method would seriously 
underestimate the true value of y(0-6). 

13.4.2 The Improved Euler method 

The Euler method of 13.4.1 is not very accurate, and the Fig. 13.8 
illustrates how the successive approximations (xlJj), (x 2 ,y 2 ), (x 3 ,y 3 ) 
can deviate from the correct positions on the integral curve through 
(x 0 , Jo). An improvement to the basic Euler method is the following: 
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(i) Use the basic Euler method as in Fig. 13.9 to estimate the value 
of y at (=x 0 + h). Call this point 

(ii) Use this latter result to estimate the value of the slope of the 

integral curve at the point Call this y±. 

(iii) Compute the average of the slopes of the integral curve at 

Oo, y 0 ), i.e. y 0 ', and the integral curve at i.e. y^. 

(iv) Use the basic Euler method at the point (x 0 , j 0 ) with the average 
slope value, i.e. KV + ji'), to estimate fo, 71 ). 

(v) Repeat the whole process to estimate the next point (x 2 , y 2 ). 

In the Fig. 13.9 the ordinate lengthy is given by 

y l = yo + W(xo,y 0 )+Xxi,yi p )\ 



The formula corresponding to (20) is 

Tn + 1 = Jn + \h[f{x nt y n ) +/(x n+1 ,/ B+1 ) ] ( 21 ) 

where y F n+1 = y n + hf(x n ,y n ) ( 22 ) 

and x n = x 0 + nh 

The superfix P, as in y^, is used to indicate the predicted value of the 
ordinate y x . This value is then corrected by using the average of the 
slopes. The technique is a simple case of the predictor-corrector method 
of finding an approximation. 

Example (i) Repeat Example (i) of 13.4.1 to find 7 ( 0 - 3 ). 
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Solution 

Here dy/dx =f{x,y) = 2 x/y and x 0 = 0, y 0 = 2. With h = 0-1, 
then x, = 01, yf = 2 + (0-l)(0) = 2 

and f(x u yi p ) = 2(0-1)/2 = 0-1. 

Hence y 1 = 2 + (0-l)(0 + 0-l)/2 

= 2005. 

Then x 2 = 0-2, y 2 p = 2005 + (0-1)(0-2/2-005) 

= 2-015 

and f(x 2 ,y 2 p ) = (0-4)/2-015 = 0-1985. 

Hence y 2 = 2-005 + (0-l)(0-0997 + 0-1985)/2 

= 2 - 020 . 

Then x 3 = 0-3, y 3 p = 2-020 + (0-1)(0-4/2-020) 

= 2-020 + 0-0198 
= 2-0398, 

and y 3 = 2-020 + (0-1X0-198 + 0-6/2-04)/2 

= 2-0446. 

Hence, j(0-3) 2-045 

This result is a/ 4-18 correct to 3 decimal places. 

Example (ii) Repeat Example (ii) of 13.4.1 to compute y(0-6). 

Solution 

Here dy/dx = f(x, y) = x + y 2 with x 0 = 0, y 0 = 0. The calcu¬ 
lation is set out in tabular form which is self-explanatory. 


n 

0 12 3 

x n 

0 0-2 0-4 0-6 

Tn 

0 0-02 0-080 0-183 

h 

0-2 0-2 0-2 

f(x n , J'n) = A n + y„ 2 

0-0 0-2 0-406 

Xji + 1 

0-2 0-4 0-6 

Tn + 1 =Jn + hf(x n ,y n ) 

0-0 0-06 0-161 

/(*n + 1 ,7n P +1 ) = + l + (J P +1 ) 2 

0-2 0-404 0-626 

Jn + 1 = Jn + W(*„, J„) +f(X n + 1 ,y p +1 )] 

0-02 0-080 0-183 
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The improved Euler method gives j(0-6) = 0-183 as the solution of 
dyjdx = x + y 2 , y( 0) = 0, a result which should be a better estimate 
than the value 0-120 of 13.4.1. 

Example (iii) Use the modified Euler method with h = 0-1 and work 
to two decimal places, to estimate j(|) given that 

£ = i +y. *o) = o. 

Solve the equation analytically and hence obtain an estimate to e. 
Solution 


Here f(x, y) 

= 1 +T 

and the 

result is tablulated 

[ below 


n 

0 

1 

2 

3 

4 

5 

x n 

0 

0-1 

0-2 

0-3 

0-4 

0-5 

yn 

0 

0-10 

0-22 

0-35 

0-49 

0-65 

h 

0-1 

0-1 

0-1 

0-1 

0-1 


f(x n , y n ) 

1-0 

1-10 

1-22 

1-35 

1-49 


Xn + 1 

0-1 

0-2 

0-3 

0-4 

0-5 


yU i 

0-1 

0-21 

0-34 

0-48 

0-64 


/fc-Xu) 

1-1 

1-21 

1-34 

1-48 

1-64 


Tn + l 

0-10 

0-22 

0-35 

0-49 

0-65 



Euler’s improved method gives the estimate 
T(l) = 0-65. 


Analytically, — =\+y 

or Jr 

and hence log (1 + j) = x + constant. 

The solution is 1 + y = Ae x , where A is a constant, and since 
j(0) = 0 it follows that A = 1, i.e. y = e 1 — 1. Using the calculated 
estimate 

e*=l +yQt) 

^ 1 + 0-65 
= 1-65 
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and hence, e ^ (1 -65) 2 ^ 2-72. 

This is the actual value of e, correct to two decimal places. 

13.5 ERROR AND PROPAGATION OF ERROR 
It has been assumed in the preceding sections that the supplied data 
are exact. For example the coefficients of equation (6) of 13.3.1 were 
assumed to be precisely those numbers. In any practical problem it is 
likely that such coefficients have been obtained as the result of an 
experiment, for example as the readings on a meter. The data are not 
exact but are approximations to some probably unknown values. Most 
engineering measurements however precise they may be are subject to 
some form of approximation and it is necessary to know how the use of 
the approximation, instead of the exact value, affects subsequent 
calculations performed with the data. 

Suppose the exact (but possibly unknown) value of some quantity is N 
and the approximation to this value is n, then the difference between 
n and N is called the absolute error in n, i.e. if this error is denoted by 


13.5.1 Errors 

Errors may be introduced in many different ways; 

(i) Human error is obvious and is particularly common, 

(ii) Round-off error is due to the need to express a number in 
decimal form and with a convenient number of decimal places. 
It is often introduced into a calculation when an exact fraction 
is written in decimal form. For example, the numbers 0-3, 0-33, 
0-333 are successive approximations to the exact number 1/3, 

(iii) Errors may be introduced due to the nature of the method of 
computation used, 

(iv) There may be errors inherent in the data being used, due perhaps 
to experimental measurement. 

The errors due to (i) may be reduced by checking and cross-checking 
and those due to (iv) may need a full statistical analysis. The chief 
concern here is with errors due to (ii). 

13.5.2 Round-off errors 

The number 130/63 can be expressed as a non-terminating decimal 
2 063492063. ... In order to use such a number in a computation it 
will usually be written as 2-06, 2-0635, 2-063492 with successively more 
accurate approximations. The approximations are the result of rounding- 
off to 2, 4, or 6 decimal places. Sometimes the term significant figures 
may be used instead. The three approximations above would then have 

3, 5, and 7 significant figures respectively. 

Example (i) The numbers 12-32,0-0134,125-0 are said to have 4, 3, and 

4, significant figures respectively. 
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It is useful to represent numbers in the form M x 10 m , where 
1 < M < 10 and m is an integer, to indicate the number of significant 
figures. Thus in the above example the numbers are respectively 
1-232 x 10 1 ; 1-34 x 10" 2 ; and 1-250 x 10 2 . All the numbers retained 
in M are significant. Note that the zeros of the second number are not 
significant whereas the zero of the third number is significant. In the 
former case the zeros merely locate the decimal point but in the latter 
case the zero serves to distinguish the number from any other number 
such as 125-4 or 125-9. 

Example (ii) Express 438-968 correct to 2 and 1 decimal places. Also 
express the number correct to 4, 3, 2, and 1 significant figures. 

Solution 

The number is 438-97 to 2 decimal places 
and 439-0 to 1 decimal place. 

The number is 4-390 X 10 2 to 4 significant figures, 
and 4-40 x 10 2 to 3 significant figures, 

and 4-4 x 10 2 to 2 significant figures, 

and 4 x 10 2 to 1 significant figure. 

The round-off rule to n significant figures is to retain n significant 
digits and discard the remainder. If the discarded number is 

(i) less than half a unit in the nth place leave the nth digit unchanged, 

(ii) greater than half a unit in the nth place increase the nth digit by 1, 

(iii) exactly half a unit in the nth place, add either 0 or 1 to the nth 
digit to make it an even number. 

Example (iii) The number 13-1535 is 13-154 to 5 significant figures; 
13-15 to 4 significant figures; and 13-2 to three significant figures. 
Example (iv) Express 14-2545 to 5, 4, and 3 significant figures. 

Solution Rounded-off to 5 significant figures the number is 14-254, the 
discarded remainder being exactly | unit in the 5th place. 

Rounded-off to 4 significant figures the number is 14-25, the discarded 
remainder 0-0045 being less than \ unit in the 4th place. 

To 3 significant figures the number is 14-3, the discarded number 
0-0545 being greater than | unit in the 3rd place. 

13.5.3 Absolute, relative, and percentage error 

The absolute error has already been defined in equation (23) as 
e = n — N .If only the magnitude of the error is important then 

H = |n-A| (24) 

is called the absolute error modulus. 

The absolute error is not necessarily a good indicator of the error. 
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For example, two measurements may be one metre and one kilometre, 
each with an absolute error of one centimetre but it is obvious that the 
former error is more significant than the latter. A better indicator in this 
case is the relative error r given by 

r = e/N (25) 

In the above illustration the relative errors are 
r = 1/100 = 001 (for N = 1 metre) 
and r = 1/100000 = 0 00001 (for N = 1 kilometre) 

In practice the number N may not be known but only its approxi¬ 
mation n, in this case the equation (25) is written 

r = e/N 

= «/(« ~ «) = («/«)(! - e/n)~ l 
= (e/n)( 1 + e/n + e 2 ln 2 -(- . . .) 

Thus a good indicator of the relative error is 

/• — £/« (26) 

The percentage error is simply lOOr. The percentage error for an absol¬ 
ute error of 1 centimetre in one metre is 1 % and for one centimetre in 
one kilometre is 0 001 %. 

13.5.4 Accuracy 

This is not simply the absence of error. For example a clock may be 
said to be accurate if it does not gain or lose more than, say, 3 seconds 
in 12 hours. In any 12-hour interval the absolute error e of the clock 
is then such that —3 < e < +3, i.e. e lies within a band of values 
(—3, +3). This band is called the error interval. If the clock was 
correct at midnight then the accurate time T hours, when the clock-time 
is T(< 12 hours) would be in the range 

T = t ± 3/3600 or t - 3/3600 < T < t+ 3/3600. 

Example Suppose the number 2-67 is the result of a rounding-off to 
two decimal places. The absolute error bound is 0 005, i.e. the correct 
number N lies within the band of values 

(2-67 - 0 005) to (2-67 + 0 005) 
i.e. 2-665 < N < 2-675 

13.5.5 Propagation of errors 

The basic algebraical operations are those of addition and subtraction, 
multiplication and division and these lead to propagation of errors. 
Errors may also be propagated by the use of other functional relation¬ 
ships, indices, powers, roots, etc. 
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(a) Addition and subtraction 

Suppose Ni and N 2 are exact, then N 3 where 

n 3 = n 1 ± n 2 

is also exact. If n x and « 2 are approximations to N x and N 2 with absolute 
errors e x = n x — N x and e 2 = n 2 — N 2 then n 3 is the approximation 
to N 3 with an error e 3 given by 

c 3 = n 3 — N 3 

= (mi ± n 2 ) — (Nx ± N 2 ) 

= (nx - Nx) ± (n 2 - N 2 ) 

i.e. e 3 = ex± e 2 (28) 

Thus, in an addition (subtraction) the absolute errors are added 
(subtracted). 

In many cases it is sufficient to use the absolute error moduli |c a |, 
|e a |, and |e 3 | and the result 

KKN + 14 (29) 

The result (29) simply states that the worst possible error will occur 
when the errors are additive. For example, in a subtraction the errors 
are additive when they are of opposite signs. 

Example (i) The two numbers 59-82 and 0-56 have been obtained by a 
rounding-off process. Estimate the error in the addition 

59-82 + 0-56 = 60-38 

Solution The maximum error modulus in 59-82 is |ci| = 0-005 and the 
maximum error modulus in 0-56 is |e 2 | = 0-005 also. If the error 
modulus in the addition is |e 3 |, then 

|e 8 | < 0-005 + 0-005 = 0-01. 

The result of the addition is then 60-38 ± 0-01, i.e. the second decimal 
place is in doubt. A more meaningful answer would be 60-4 correct to 
3 significant figures. The relative error is 0-01/60-38, i.e. 0-0002. 

Example (ii) The numbers 70-03 and 69-25 have been rounded-off to 
2 decimal places. Express the difference of the two numbers to a 
meaningful number of significant figures. 

Solution The difference 0-78 has an error e 3 and 
|«a| < 0-005 + 0 005 = 0-01 

The last digit is again in doubt and the difference lies in the band of 
values0-77 to 0-79. Consequently the result is 0-8 correct to 1 significant 
figure. Notice that the original numbers are of comparable size and 
each has 4 significant figures but the difference has only 1 significant 
figure. There can be a large loss of significant figures in a subtraction 
and this can be a source of serious error in an involved calculation. 
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(b) Multiplication and division 

Suppose that n 3 = n x n 2 is an approximation to the exact result 
N 3 = N 1 N 2 and that 

£i = «i — N x 

e 2 = n 2 — N 2 
e 3 = n 3 - N 3 

It follows that 

s 3 = n 3 - N 3 


= «1«2 - («i - £i)(« 2 — s 2 ) 

= «in 2 — («in 2 — «i«2 — « 2 «i + £i£ 2 ) 

= e 1 n 2 + £ 2 «i — £\E 2 

If, as is frequently the case, is small compared to the other term 
then 

«3 — £i« 2 + e 2 «i (30) 

is a measure of the absolute error involved. Dividing both sides of (30) 
by n 3 {= «x« 2 ) then an estimate of the relative error involved in the 
multiplication is given by 

£3 = 2 .£l^2 + £a«i 

« 3 «i « 2 " r rtl « 2 

i-e. r 3 ^ ri + r 2 (31) 

Similarly, in a division, suppose that n 3 = n^ii 2 is an approximation to 
N 3 = N 1 jN 2 then 

e 3 = «3 — N 3 

= «!/« 2 - Ai/fla 
= «i/«a ~ («i - £i )/(«2 ~ £ 2 ) 

= «i /«2 - («i/« 2 )[(l - £i/«i)(l - £ 2 /n 2 )~ 1 ] 

= «i/» 2 - («i/« 2 )[l + e 2 /« 2 - ex/fix - e 1 e 2 //i 1 n 2 + . . .] 

= («a/»2)[ei/»i - e 2 /« 2 ] + . . . 

An estimate of the relative error in a division is given by 

*3 — »i - r 2 (32) 

Thus in a multiplication (division) an estimate of the error is obtained 
by adding (subtracting) the relative errors. 

In many numerical calculations it may not be known whether the 
error is positive or negative and the only meaningful error measure is the 
absolute error modulus |e| or the relative error modulus |r|. The two 
estimates (31) and (32) are frequently replaced by their cruder, but 
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N < kil + k.l- (33) 

Note that |r x + r 2 \ < \r t \ + |r 2 | and |r a — r 2 | < |nl + \r 2 \ (see also 
Section 10.3.4). 

Example (iii) The numbers 1-3 and 2-4 are the result of a rounding-olf 
process. Give the product 1-3 X 2-4 to a meaningful number of signi¬ 
ficant figures. 

Solution If the numbers are assumed accurate then 1-3 x 2-4 = 3-12. 
Due to the rounding-off process each number has an absolute error 
modulus of 0 05. The relative error moduli are 

kil = TT = 0038 and ^ = = 0021 

respectively. From (33) an estimate of the relative error modulus of the 
pf0< ' U,!tiS |r.|<W + |r.|-0O59 

The absolute error modulus is then estimated as 
|«a| <0 059 X 3 12 = 0 1841. 

The product N = 1-3 x 2-4 has an estimated value given by 
3-12 — 0-18 < TV < 3*12 + 0-18 
i.e. 2-94 < N < 3-30 

The result of the multiplication to a meaningful number of significant 
figures is thus N = 3. 

Instead of working in terms of the relative error (with the simple rule 
of addition (subtraction) in the case of multiplication (division) it is 
often convenient to work with the absolute error. 

Example (iv) Estimate 3-2/0-411 if the numbers are the result of round- 
ing-off. 

Solution The quotient is N and 
3-2 ± 0-05 
^ ~ 0-411 ± 0 0005 

(3-2 ± 0-05)(l ± 0-0012)" 1 
- 0-411 

= (3-2 ± 0-05)(l T 0-0012 . . ,)/0-411 
or N — [3-2 ± (0-05 + 3-2 X 0-0012)]/0-411 (for maximum possible 
error). 

Hence, N — 7-786 ± 0-131 and N has a value in the range 7-66 to 
7-92. Thus it is possible to claim that N has the value 8 correct to 1 
significant figure. 
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(c) Functional relationships 

Suppose that n 2 = f(n,) is an approximation to the exact result 
N 2 =f(N 1 ) where f(x) is some functional relationship of x. If n, has 
an absolute error and n 2 an absolute error e 2 then 

e 2 = n 2 - N 2 =f(n 1 ) — f(N,) 

= /(»i) -fin, - e0 

= /(«1) ~ If (hi) - £i/'(«i) + £i 2 />i) • ■ •] 

assuming the Taylor expansion of section 5.2. Thus 

e 2 ej'in,) (34) 

Example (v) If 1 -377 is the result of a rounding-off to 4 significant 
figures, estimate the absolute error modulus of log 1-377 and hence give 
the result to an appropiate number of figures. 

Solution From (34) 

tel = 0-0005 

and /'(«i) = 1/1-377. 

The estimate for |e 2 | is then 

|e 2 | — (0-0005)(l/l-377) 

—0-0004. 

Using 4 figure tables 

log 1-377 = 0-3200 ± 0-0004. 

The result is 0-320 correct to 3 significant figures. 

13 6 ERRORS IN ITERATIVE PROCESSES 

An iterative process for calculating a root of f(x) = 0 is often written 
in the form 

x n+1 = F(x n ). (35) 

For example, in 13.2.2 F(x n ) is <j>(x n ) where 
x - <f>(x) =/(x) 

while, in 13.2.3, F(x n ) is the expression given by (4), i.e. 

F(x n ) = x n -f(x n )/f'(x n ). 

Suppose the absolute error in the iterate x n is e n then, if X is the correct 
value, (35) leads to 

^+ £n+ 1 =F(A+ £n ) 

- F(X) + e n F'(X) + j r e n 2 F"(X) + . . . 
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which, with X = F(X) reduces to 

£ n +i — E n F'(X) + W F"(X). 

For the iterative method of 13.2.2, F(X) = <f>(X) 
i.e. F'(X)=i\X) 

and c n +i — e n<f>'(.X) + |£„ a fW 

i.e. £ n+1 ^£ n f(Z). (36) 

From (36) it may be noted that if |^'(^)| <1 then 

l«. + il < w 

and the error decreases as the process proceeds i.e. the process is 
convergent. 

For the iterative method of 13.2.3 
F(X) = X-f(X)lf'(X) 

and hence 

F\X) = 1 -f\X)lf\X) +f(X)f\X)l{f'(X)} 2 

=f{X)f\x)i{f\xw 

= 0 


since f(X) = 0. 

It follows that 

F\X) =nx)f\x)!{f\x)Y 
=f"(X)lf\X) 

and so e n + 1 ^As n 2 (37) 

where A = f"(X)l2f(X). 

From (37) it may be noted that the error in the (n + l)th iterate is 
dependent on the square of the error in the nth iterate. This is one 
reason why the Newton-Raphson method of 13.2.3. is usually a 
rapidly convergent process. However the form of A also shows that if 
f"(X)lf(X) is too large (i.e. if f’(X) is too small and/or f"{X) is too 
large) then the process could diverge. The equation (37) shows that if 
Ae n > 1, i.e. £ n+ i > £„ then this divergent process is possible. For 
example if A = 15 and £ 0 = 01 then (37) gives £ x 0T5, e 2 === 0-33, 
and £ 3 — 1-5 and the process diverges. Such a divergent process is 
illustrated in Fig. 13.6(c) of section 13.2.3. It may be possible to prevent 
this divergency by starting from a better approximation. For example, 
if A = 15 but £ 0 = 0 05 then £j — 0 04, s 2 — 0 02, and e 3 — 0-006 and 
the process converges. 
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EXERCISE 13 

1 Show that the equation x + 2 sin x = 1 has a root between 0 and 1 
and use the rule of false position to find it to two decimal places. 

2 Show graphically that 2 cos 2x = e 1 has only one positive root 
but an infinity of negative roots. Use the rule of false position to 
find, to two places of decimals, the two roots of numerically least 
value. 

3 Find the root of Question 1, correct to four places of decimals, 
by the Newton-Raphson method. 

4 Show that 4x 3 + 3x 2 — 6x — 1 = e _I has two negative roots and 
one positive root, and use the Newton-Raphson method to find 
the negative root between — 1 and 0 to three decimal places. 

5 Evaluate y/7 to four significant figures by the Newton-Raphson 
method. 

6 Show that if x r is an approximation to tya then a better approxima¬ 
tion is given by 

x r+l = M2* r + a/x 2 ]. 

Evaluate ^10 correct to four significant figures. 

7 Use the iteration procedure, x r+1 = 4>{x r ), to find the positive 
root of the equation 6x = log (x + 2) correct to four decimal 
places. 

8 Find, to four decimal places, the smallest positive root of 
x 3 — 3x + 1 = 0 using (i) the iteration procedure of section 13.2.2, 
and (ii) the Newton-Raphson method. Find also the negative root 
of the equation. 

9 Use Gaussian elimination to solve the following equations 

1-23* + 0-86y + 0-39z = 3-50 

0 06x - 2-01y - 0-26z = 2-62 

0-39* - 0-24/ + l-53z = —1-98 

Use a check column and calculate the first estimates to three 
places of decimals. Calculate the residuals and form the second 
estimates. 

10 Solve the equations of Question 9 using (i) the Gauss-Jacobi 
method, and (ii) the Gauss-Seidel method. 

11 Obtain x u x 2 , x 3 , and x 4 correct to three places of decimals by the 
Gauss-Seidel method, given 

20*! — x 2 + x 3 — x t = 8 

Xi — 40x 2 + 4x 3 + x 4 = 4 

—+ x 2 + 10x 3 = 6 

x 2 — x 3 + 8x 4 = —5. 
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12 Use Euler’s method with step length 0-2 to solve 

dyldx + y 2 = 0, j(0) = 1 

for values of x up to x = 1 (work to two decimal places). Integrate 
the equation and compare the approximate value of y(l) with the 
actual value. 

13 Solve numerically the differential equation 

dyjdx = xy + y 2 

with initial value y( 0) = 1 using the improved Euler method. Use a 
step size of 01 for x up to 0-4. Work to three decimal places and 
give the answer to two decimal places. 

14 Solve numerically the differential equation 

dyfdx = x — y 2 

with initial value/(0) = 1, using the improved Euler method with 
step size 0-2 for x up to 1-0 and calculate to three decimal places. 
From a graph of the results find approximately the minimum value 
of/ and the value of x at which / reaches this minimum. 

15 If all the numbers are rounded-off find the maximum possible 
error of the following calculations and give the answer to an 
appropriate number of decimal places 

(2 00 X 4 00) + 3-62 - 9-82 

16 If x = 0-56 is the result of rounding-off give the value of e“ 3 * to an 
appropriate number of decimal places (tables give e" 1 - 68 as 0-1864). 
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Numerical Methods II 


14.1 INTERPOLATION 

The reader is probably familiar with the idea of interpolation as 
applied to a table of values of a variable. For example, suppose that 
the function f(x) is tabulated at intervals of 0-1 from x = 10 to x = 2 0 
and that it is required to find /(1-05). The function f(x) may be a 
known function, but complicated, or it may be an unknown function 
the tabulation having being obtained as the result of an experiment. 
Let /(1-0) = 2-718 28 and /(H) = 3-004 17 then an approximation to 
/(I-05) can be obtained by assuming that the graph of/(x) in the range 
1-0 < x < 1-1 is a straight line so that/(1-05) = 2-718 28 + 1(3-004 17 
- 2-718 28) = 2-861 22. 

The general process of finding a value of the function at other 
than the tabulated values is called interpolation , the simple process 
above is called linear interpolation and is unlikely, in the example 
given, to be accurate beyond the second decimal place. 

A better approximation to the required value is obtained by using 
all of the known tabulated values instead of just the two values on 
either side of the interpolated point. This can be done by replacing 
the function f(x) by a simpler function p(x) that takes the values of 
f(x) at all of the tabulated points and from which the value of /?(l-05) 
can be found by a simple computation. There are many types of replace¬ 
ment functions p(x), the simplest of which is a series of powers of x, 
i.e. a polynomial function. Other types of replacement functions that 
are in common use are series of trigonometric functions, or series of 
exponential functions, or series of more complicated (but well known) 
mathematical functions. 

It is in this wider sense, that is in the finding of the best type of 
replacement function, that the word interpolation is often used nowa¬ 
days. The polynomial functions can lead to accurate interpolation 
formulae and these are the only functions to be discussed here. 

14.2 DIFFERENCE TABLE 

Suppose y 0 , y u y 2 , ■ • ., y n is a set of values of a (known or unknown) 
function /(x) corresponding to a set of values x 0 , x t , x 2 , . . x n 
written in ascending order of magnitude so that x 0 < Xj < . . . 
<*„_! < x n . The variable x is called the argument and the variable 
values y r are called the entries, the corresponding values being con¬ 
veniently displayed in table form. It is required to estimate the value 
of the entry that would be obtained for a value of the argument in the 
range .v 0 < x < x n . 
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Any reliable interpolation formula used to find this estimate will 
employ all the given information, that is it will be based on the values 
of the n + 1 entries. 

Instead of using the values directly it is found convenient to use 
relationships dependent on these values. The relationship y r+1 — y T is 
called the first difference of y, and is denoted by A y„ i.e. 

Aj r =/ r+ 1 -T,. ( 1 ) 

The relationship Ay r+1 — Ay r , being the difference of the first difference 
of y r , is called the second difference of y T and is denoted by A(A y T ) or 
A 2 y„ i.e. 

A 2 y r = A/ r+1 - A y r . (2) 

For example, 

A 2 y 0 = Ay 1 — Ay 0 

= (72 - yd - (yi - 7o) 

= 72 - 27 ! +y 0 , 
and similarly, A 2 /! = 73 — 2 y 2 + y x . 

In general terms 

A 2 y r = 7r+a - 2 y r+1 +y„ r = 0, 1, 2. n - 2. (3) 

In a similar manner the third differences may be formed so that 
A 3 y r = A 2 y I + i — A 2 / r 

== (7r + 3 - 27 ,+ 2 + 7r + i) ~ (7r + 2 ~ 2y t + 1 +y r ) 

= 7r +3 - 3 y r+a + 3y r+1 - y r ; r = 0,1,2,. . n - 3. (4) 

Clearly, successive differences may be calculated, each difference 
being dependent on a selection of the given entries y 0 , y t , y 2 , . . y n . 

x _y_Ay_Ay Ay Ay Ay 


Ay 0 



y\ 


Ay 0 




Ay, 


4 

x 2 

y z 


Ay, 




A y 2 


Ay, 


*3 


A 2 y 2 




Ay 3 


Ay 2 

x 4 

y 4 


A 2 y 3 



Ay 4 

Vs 


Table 14.1 
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The polynomial interpolation formula, dependent on the n + 1 
entries, may then be expressed in terms of these differences. For 
convenience the differences are set out in tabulated form, entries in 
successive difference columns being placed in the table mid-way 
between the two entries used in forming that difference as in Table 
14.1. 

Example Form the difference table for the function 
y = x 2 - 2 

for x = 0 to x = 6 at intervals of 1 unit. 

Solution The values of y are tabulated and successive differences found 
as below: 

x y Ay A 2 y A 3 y 
0 -2 

1 

1 -1 2 

3 0 

2 2 2 

5 0 

3 7 2 

7 0 

4 14 2 

9 0 

5 23 2 

11 

6 34 

In this example all the differences after the second are zero. It is no 
coincidence that the second differences of a second degree polynomial 
are the same. For, consider a polynomial of degree s, i.e. 

y = a s x* + a.-iX* -1 + . . . + apx + a 0 

and suppose a difference table is formed for equal intervals h of the 
argument x, then x r+1 — x T + h and 
Ay, =y r+ 1 —y. 


= a s (x s r + h + . . . — xf) + . . . 

= a^hx 3 ' 1 + terms in lower powers of x„ 

and 

A 2 y r = A y r+1 — Ay, 

= a s sh[(x, + h)*- 1 - xr 1 ] + - . • 

= a^s — l)h 2 x s ,~ 2 + terms in lower powers of x r . 
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Each successive difference is a polynomial of degree one less than its 
predecessor. Finally, 

&y r - a,(s'.)h’ 

= constant. (5) 

The equation (5) shows that the 5th difference of a polynomial of 
degree s is constant and it follows that all subsequent differences are 
zero. The converse may be used to deduce a polynomial relationship 
as in the following example (see also example 2 of Exercise 14). 
Example In the following difference table 

x y Av A 2 y A 3 y 

0 4 

19 

1 23 18 

37 6 

2 60 24 

61 6 

3 121 30 

91 6 

4 212 36 

127 6 

5 339 42 

6 508 169 

the third difference column is constant and there is an exact poly¬ 
nomial relationship between x and y of degree three in x. The observant 
reader will see that y = (x + 2) 3 — 4. 

It is possible for the values of the argument x to be given at unequal 
intervals and in this case the above tables are modified by employing 
divided differences. Only equal intervals of the argument are considered 
in this book. 

14.2.1 Errors in a difference table 

The first step in any interpolation procedure is the formation of a 
difference table from the given data and since this involves a large 
number of additions, subtractions, and entering of results it is a source 
of numerical error. A simple check can be obtained by adding the 
entries of any column of differences and comparing the result with the 
difference between the last and first entries of the previous column. 

For example, adding the entries of a second difference column 
A^o + A 2 y, + A *y 2 + . . . + A y 5 
= (Ayi - A yo) + (Ay 2 - A/0 + . . . + (A y e - Ay 5 ) 

= Aye - Ay 0 . 

An error is frequently made in the entry column y and care must be 
taken when transferring the data from its given state (e.g. as the 
result of an experiment) to a difference table. Suppose that the entry 
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v 3 of Table 14.1 is 

subject to an 

error e. 

then Table 14.2 shows how 

this < 

;rror is propagated throughout the table. 


The error in (say) the third difference column appears in four terms 

with 

error coefficients given by the binomial coefficients of (1 — s) 3 . 


x y 

Ay 


aV 

A’y 


x 0 y 0 

Ay 0 





*1 * 



A ho 




Ay, 



Ay 0 + e 





A 2 y, + e 




Ay 2 + 



Ay, “ 3e 


*3 ' * + 



iy z - 2 1 

A 5 y 2 + 3e 


*4 y4 



A 2 y s + « 



*s y* 

Ay 4 


tfyl'"""-' 

- - 



Ay s 





y« 







Table 14.2 



Example Find the errors in the following table. 


X 

y 

Ay 

A 2 y 

A 3 y 

Error in A 3 y 

0 

34-8 

38-6 




1 

734 

60-2 

21-6 

4-8 


2 

133-6 

86-6 

26-4 

4-8 


3 

220-2 

117-8 

31-2 

4-3 

= 4-8 - 0-5 

4 

3380 

153-3 

35-5 

6-3 

= 4-8 + 1-5 

5 

491-3 - 


- 41-8— 

- - - - 

- 



195-1 


3-3 

= 4-8 - 1-5 

6 

686-4 

240-2 

45-1 

5-3 

= 4-8 + 0-5 

7 

926-6 

290-6 

50-4 

3-8 


8 

1 217-2 

345-8 

55-2 



9 

1 563-0 







(1 528-2) 

(307-2) 

(32-6) 
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Solution Adding each difference column and checking against the 
difference of the first and last entries of the preceding column reveals 
an error in the last column, i.e. 55-2 — 21-6 = 33-6. Recalculation of 
A 3 j shows that the last entry should be 4-8 and not 3-8. The appearance 
of the A 3 y column suggests now that all the entries should be 4-8. 
The values 4-3, 6-3, 3-3, and 5-3 are 4-8 — 0-5,4-8 + 3(0-5), 4-8 — 3(0-5), 
and 4-8 + 0-5 respectively. Comparison with Table 14.2 shows that 
there is an error e = —0-5 in the entry 491-3. The corrected value 
is 491-8. It is left to the reader to complete the corrected table. 

14.3 THE NEWTON-GREGORY FORMULAE OF 
INTERPOLATION 

Lety_ n ,y_ n + 1 ,. . ., y_ lt y 0 ,y u . . .,j„_i, 7 n be values of the entries 
corresponding to equally spaced values x_ n , x_ B+1 , . . ., x_ u x 0 , 
*i, . • ■, x n -i, x„ in the argument. The unique polynomial function 
y = p(x) satisfying any n + 1 successive values is of degree n and 
could have the form 

p{x) = A 0 + A x x + A 2 x 2 + . . . + A n x n (6) 

the n + 1 unknowns A { being found from the n + 1 linear equations 
obtained by substituting the corresponding x and y values in function 


14.3.1 The forward interpolation formula 

Suppose the n + 1 successive values are (x u y t ), i = 0,1,2,. . ., n 
then it is more convenient to write function (6) in the form 

p(x) = a 0 + (x- x 0 ) ai + (x- x 0 )(x - Xl )a 2 + . . . 

■•• + (*- *o)(x - Xi). . . (x - x n _i)a n (7) 

and to use the n + 1 values to calculate the a { . Let the equal intervals 
between successive arguments be h, then 

x r — x s = (r — s)h. (8) 

The unknowns are then determined from the n + 1 equations 
>V = a 0 + rha x + r(r - l)h 2 a 2 + r(r ~ l)(r - 2)h 3 a 3 + ... 

+ r(r - l)(r - 2) . . . (r - n + 1 )h n a n 

= a 0 + 2 r(r - l)(r - 2) . . . (r - t + l)h l a t (9) 

with r = 0, 1, 2, . . ., n. 

Forming successive differences leads to 
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A 2 y r = A y r+1 — Ay r 
= 2 h 2 a 2 

+ t W - ~ 1)(T - 2) . . . (r - t + 3 )]h l a t (11) 

A 3 y r = A 2 y r +i — A 2 y r 
= 3.2A 3 a 3 

+ i K* - 1)(' - 2)] Hr - l)(r - 2) 

. . . (r - t + 4)]A f a ( (12) 


A n_1 j/ r = (n - l)!/j n_1 a n _ l + [«(« - 1)(« - 2) . . .2][r]A“a. 
A n j; r = n\h n a n 

From equations (9) to (14) it follows that 

a 0 = Jo! = Ajo/A; «2 = A 2 y 0 /2\h 2 ; . . 
a n = A n y 0 /ti\h n 
and hence in equation (7) 

, , ,(*-*<>)*. . (x - x 0 )(x - x x ) A 2 y 0 , 

P(x) =y 0 + -^- Ay 0 + --2T + ‘ ' 


(13) 

(14) 

05) 


{x - x 0 )(x - x x ) . . . (x - x n _ 0 A"y 0 
+ h n «! 

in which x is any value in the range x 0 < x < x n . 

If the argument is written 

x = x 0 + uh 

where 0 < u < n, then, with the use of (8), equation (16) may be 
expressed in the form 
y = /(x) =/(x 0 + uh) —p(x 0 + uh) 

A 2 y n 

= y 0 + uAy 0 + u(u - 1) -jf- + . . . 

+ u(u - 1X« - 2) ...(«-»+ 1) ^f° (18) 


(16) 

(17) 


where p(x) is the polynomial approximating the actual (unknown) 
function /(x). An examination of Table 14.1 and equation (18) shows 
that the ‘path’ of this interpolation proceeds through the table from y 0 
along the diagonal of the leading differences of y 0 , i.e. the formula uses 
those values forward (in the table) fromj 0 . The formula (18) is frequently 
called the forward Newton-Gregory interpolation formula and is used 
mainly for interpolation near the beginning of the table where u is 
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small. The coefficients of equation (18) are binomial coefficients and the 
formula may be written 

y-y„ + 2 (“) A> 0 (19) 

where (“) is - ' X “ - - - <“ - r + & 


Example (i) In the following table use the result (18) to find (a) ft 2-4), 
(b)/(8-7). 


JC I 2 

4 

6 

8 10 

fix) 1 9-68 

10-96 

12-32 

13-76 15-28 

Solution Form a difference table and note that all differences beyond 

the second are zero. 




X 

s 

II 

A y 

A 2 j 

2 

9-68 

1*28 


4 

10-96 

1-36 

0-08 

6 

12-32 

1-44 

0-08 

8 

13-76 

1-52 

0-08 

10 

15-28 



(a) From (17) with x 
equation (18) gives 

= 2-4, * 0 

= 2, 

h = 2, then u = 0-2 and 

f(2-4) = 9-68 + 0-2 X 1-28 + (0-2)(- 

V™> + 0 + 0 ... 


= 9-929 6 


(b) From (17) with x = 8-7, x 0 = 2, then u = 3-35 and equation 
(18) gives 

/(8-7) ^ 9-68 + (3-35)(l-28) + (3-35)(2-35) 

= 14-282 9. 

Example (ii) In the following table of e x use the formula (18) to calcu¬ 
late (a) e 0 ' 12 , and (b) e 2 ' 00 . 

x I 0-J 0-6 1-1 1-6 2-1 

e 1 I 1 105 2 1-822 1 3-004 2 4-953 0 8-166 2 

Solution Form a difference table and note that in this case there is no 
difference column that is constant (this is to be expected since e* cannot 
be represented by a polynomial function). 
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x y = e 1 A/ A 2 y A 3 _y A 4 _y 

0-1 1-105 2 

0-6 1-822 1 

1-1 3-004 2 

1- 6 4-953 0 

2- 1 8-166 2 

(a) with x = 0-12, x 0 = 0-10, h = 0-5, then u = 0-04 and (18) gives 
e°- 12 = 2 = 1-105 2 + (0-04)(0-716 9) + (0 04)(-0-96) ( °'^ - 2) 

+ (0-04)(—0-96)(—1-96) (0 ‘ 3 ^ 5) 

+ (0-04)(—0-96)(—1-96)(—2-96) ((H ^ 6 2) 

= 1-126 9. 

The correct value to 5 decimal places is known to be 1-127 50. 

(b) with x = 2-00, x 0 = 0-10, h = 0-5, then u = 3-8 and equation (18) 
gives 

e 2 -oo ^ 1-105 2 + 2-724 22 + 2-474 86 + 0-962 39 + 0-125 25 
= 7-391 9 (to four decimal places). 

The correct value to four decimal places is known to be 7-389 1. 

In example (i) the Newton-Gregory formula (18) for p(x) is identical 
with the function /(x), which is a quadratic function, and the results 
for/(2-4) and/(8-7) will therefore be correct to the number of decimal 
places retained. In example (ii) the function e 1 is replaced by a fourth 
degree polynomial which takes the value of e 1 at the five given entries. 
Because the successive differences decrease, the higher differences are 
relatively small so that any term in the formula has a smaller contribu¬ 
tion than its predecessor. For the range of values given (i.e. x = 0-1 
to x = 2-1) the quartic polynomial is a reasonable approximation to 
the actual function e*. The error involved in the estimation for e° 12 is 
about 0-05 per cent and in the estimation for e 2 ' 00 is about 0-04 per cent. 

Example (iii) Use the formula (18) to estimate the value of/(2-5) in 
the following table of/(x). 

x I 2 3 4 5 6 

/(x) I 1 2 6 24 120 


0-716 9 

0-465 2 

1-182 1 0-301 5 

0-766 7 0-196 2 

1-948 8 0-497 7 

1-264 4 

3-213 2 
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Solution Form the difference table from the given data 

x f{x) Ay A 2 y A 3 y A *y 

2 I 


3 

4 

5 


2 

6 

24 


3 

4 11 

14 

18 64 

78 
96 


53 


6 120 

With x = 2-5, x 0 = 2, h = 1, then « = 0-5 and formula (18) gives 
/(2-5) — 1 + 0-5 - 0-375 + 0-687 5 - 2-070 3 


= -0-257 8. 


The function /(x) in this example is well-known and is called the 
Gamma function or the generalized factorial function, fix) = (x - 1)!. 
The value of/(2-5) is known to be 1-329 3. The estimated value is 
obviously incorrect, the reason being that the successive differences are 
increasing rapidly and the series obtained from formula (18) is a 
divergent one. The values of fix) are increasing so rapidly that it is 
impossible for a fourth degree polynomial to approximate to the 
function even though it is exact at the given five values. In other words, 
the information provided (i.e. the given data) is too meagre to describe 
the behaviour of the function. 

Although the exponential function of example (ii) is also a rapidly 
increasing function the data are given with an interval of x small enough 
to be able to interpolate with reasonable accuracy within the range of 
the table. If here, the interval of the argument x had been too large 
(say h — 1) the interpolation formula would not have provided an 
accurate estimation except possibly at values extremely close to the 
given entries. The difference table below shows how rapidly the differ¬ 
ences are increasing for h = 1. 

x e x Ay A 2 y A 3 _y A i y 

o i ~ 


1 

2 

3 

4 


2-718 3 
7-389 1 
20-085 5 
54-598 2 


1-718 3 
4-670 8 
12-696 4 
34-512 7 


2-952 5 
8-025 6 
21-816 3 


5-073 1 
13-790 7 


8-717 6 


Another form of the interpolation formula may be used when inter¬ 
polation is required at a value near the end of the table. 
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14.3.2 The Newton-Gregory formula for backward interpolation 

Suppose that (x 0 , j> 0 ) is at the end of the table then it is required to 
find a polynomial that satisfies the (n + 1) values (x_ j_ { ), i = 0,1,2,3, 
. . ., n. Proceeding as in section 14.3.1 it is convenient to write function 
(6) in the form 

p(x) = b 0 + (x — x 0 )b 1 + (x - x 0 )(x - x^ 1 )b 2 + . . . 

+ (x - x 0 )(x - x_j).. .(x - x- (n - v )b n (20) 

with 

x_ r - X— s = —(r - s)h, (21) 

from which it follows that 

y- r = b 0 + i (-l)V(r - l)(r - 2) . . . (r - t + 1 )h%. (22) 

Taking successive differences of equation (22) and using the notation 
of Table 14.3 leads to 

b 0 = y 0 \ hbj. = Ay_ x ; 2 \h*b 2 = A 2 7 _ 2 ; . . .; n\h n b n = A"y_„. (23) 

x y_ Ay Ay Ay A^y 


Ay.„ 


a 3 /-3 


Substituting into equation (20) and using 
x = x 0 4- uh 

gives the backward interpolation formula in the form 
y = /(*) = /('Vo + ub) — p(x 0 + uh) 


( 24 ) 







which may be written 
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J'-ft + J. (“ + ' r (26) 

where (») = ^-lX»-2), j .>-r+l) 

Table 14.4 shows the paths of the forward and backward interpolation 
formulae through the difference table for n = 5. 
a:_s y-s 


X4 


*5 



- Newton - Gregory (orword 

—*—»— Newton - Gregory boekward 


Table 14.4 


Example Apply the backward interpolation formula (26) to find e 2 ' 00 
in example (ii), section 14.3.1. 


Solution Using the difference table of that example and substituting 
into equation (26) with x = 2 00, x 0 = 2T0, h = 0-5, so that u = —0-2, 
leads to 


- 8-166 2 — (0-2)(3-213 2) 
(1-264 4) 


- (0-2)(0-8)(l-8)(2-8) 


6 

(0 196 2) 


= 7-392 0. 
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14.3.3 Interpolation paths 

It has been observed (see Table 14.4) that the two Newton-Gregory 
formulae use the in formation of the difference table obtained by pro¬ 
ceeding through the table along different paths. The forward formula 
is useful when interpolating near the beginning of the table while the 
backward formula is useful when interpolating near the end of the table. 
Both formulae may be used with a reasonable expectation of accuracy 
providing that the interval h in the given data is not too large and 
providing always that the higher differences decrease in value and 
become eventually negligibly small. 

Interpolation is frequently required near the centre of the table and 
the best interpolation formula in such a case will be obtained by pro¬ 
ceeding through the difference table by a path that is as near as possible 
to the horizontal line through the interpolated value. There are a number 
of different paths starting from (say) the value y 0 in the Table 14.4. 
Each path will provide a different form of the interpolating polynomial, 
the various coefficients being obtained from the differences encountered 
along and on either side of the path. 

Historically, each different formula (but all variants of the original 
polynomial (6)) was obtained separately and bears the name of its 
discoverer, e.g. Gauss, Bessel, Stirling. However, these formulae, and 
indeed any other formula, can be obtained from a simple diagram 
called the lozenge diagram together with four basic rules of derivation. 
The form of the diagram and the basic rules will be deduced from the 
two Newton formulae in the next section and then used to derive the 
other named formulae. 

The next section may be omitted at a first or casual reading and the 
reader may, without immediate loss, move to section 14.5 where these 
formulae are derived and applied to interpolation problems. 


14.4 THE LOZENGE DIAGRAM 

The Table 14.5 shows the lozenge diagram which is, in essence, the 
difference table about the point y 0 together with the binomial coefficients 
of the Newton-Gregory formulae displayed according to a definite 

pattern. The coefficient f is placed immediately above the 
difference A T y,. It may be noted that = 1 for all s. 

The two Newton formulae 

y = f(x 0 + uh ) ^ p(x 0 + uh) 

= y° + (“)Ay„ + (“)+ (“)A 3 y° + . . . +(“)a’>0 + • • ■ 

(27) 
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and 

y = fix o + uh) — p(x Q + uh ) 

= 7 ° + Q Ay_ x + (“ 2 + (“ 3 2 ) A3 J-a + • ■ • 

+ (" + ^ _1 ) A ->_ r . . . (28) 

may be obtained from Table 14.5 by proceeding along the appropriate 
diagonal paths and applying the following two rules: 

Rule 1 If, on arrival at a difference entry, the slope of the step was 
negative then form the product of that difference and the binomial 
coefficient immediately above it; but, if the slope was positive then form 
the product of the difference and the binomial coefficient immediately 
below it. 

Rule 2 Add the successive products formed on proceeding through the 
table from left to right. 



Table 14.5 


Example (i) Starting from y 0 and proceeding via the diagonal path of 
negative slope then 

7=^0+ (“) A 7o + (“) A 2 _y 0 + A 3 / 0 + . . . 

which is equation (27). 
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Example (ii) Starting from y 0 and proceeding via the diagonal of 
positive slope 

y = Ay-! + (“ 2 l ) A*y . 2 + (“ 3 2 ) A ^-3 + • • • 

which is equation (28). 

Example (iii) The backward Newton-Gregory formula starting at y x 
and using Rules 1 and 2 is 

y = yi + (“ 7 *) A* + (2) A2 j-i + (“ 3 *) A3 t-s + • • ■ • 

Suppose now that a zig-zag path passing through y 0 , Ay 0 , A 2 y„ lt 
A 3 y_ j, A 2 , is considered, then the two rules would give the formula 

y = To + (j) A Jo + (2) A2 J-1 + (“ * *) A3 - y - 1 

+ (" t ! ) + • • • < 29 > 

This is a valid interpolation formula known as the Gauss forward 
formula. 

Similarly taking a zig-zag path from y 0 starting in the opposite 
direction leads to 

y = y° + (“) A r-i + (“ t') + (” 3 ') 

+ ' ' (30) 

which is a result known as the Gauss backward formula of interpolation. 

It is evident that Rules 1 and 2 are sufficient to provide certain inter¬ 
polation formulae but before it can be stated that any path gives a valid 
interpolation formula it is necessary to prove first that any two paths 
through the table with the same start and finish are identical and second 
that if any two paths, entering the table at different points, end at the 
same point then the derived formulae are identical. For example, in 
Table 14.5 a path that starts at y 0 and ends at A e y- 3 will give a formula 
that is a sixth degree polynomial satisfying the seven values (x_ 3 , 7 - 3 ), 
. . ., (x 3 ,j 3 ) and the second requirement would show that the same 
polynomial should be found by any other path starting at (say) y 2 and 
ending at A 6 j_ 3 . To prove the first requirement it is necessary only to 
consider the paths taken round one complete lozenge. Consider the 
paths 1 and 2 in Fig. 14.1. 
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The contribution from path 1 is 

and the contribution from path 2 is 

(“7 s ) a >,+(“;;) a '* v . 02 ) 

That the contributions (31) and (32) are identical is easily shown by 
substituting 

A r _y, = Ay,.,. + A r+1 y,_ 1 
in (32) and using the fact that 



Since the contributions (31) and (32) are identical, then each is equiva¬ 
lent to 


^• + a >~](”7 5 H[(" 


-s+l\ 
r+1 ) 




Examination of (33) shows that the step from A r_1 j, to A r+1 j',_ 1 can 
be taken by the horizontal path 3 of Fig. 14.1 providing the following 
two rules are obeyed: 

Rule 3 If in a horizontal step from left to right, the path passes through 
a factor then add the product of that factor and the average of the 
differences immediately above and below. 

Rule 4 If in a horizontal step from left to right, the path passes through 
a column entry then add the product of that entry and the average of 
the factors immediately above and below. 
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Finally to justify the second requirement that the same formula is 
obtained by different paths terminating at the same point in the table, 
consider the two paths of Fig. 14.2. 



The contributions are 

Path 1: + i[A j_ x + Aj 0 ] (“) + \ [(" 2 ^ + (2)] 

+ (“J 1 )a>- 1 (34) 

Path 2: yx + (“ 7 *) 4* + (“ ~ A 2 j 0 + A 3 /., (35) 

In (34), put 

A 2 j_! = A 2 j 0 - A 3 j_ l5 
Aj_! = Aj 0 — A 2 J_! 

= Aj 0 - A 2 j 0 + A 3 j_!, 

and in (35), put 

yi=y 0 + A jo, 

Aj x = Aj 0 + A 2 j 0 , 
then the contributions are 

Pa U ,l ;Jo+ (“)A /o + i A V .[(“+ 1 ) + (")-(")] 

<3, 

Path 2: y a + Ay, [l + (“ 7 ')] + [(“ 7 ') + (” 2 *)] 

+ (")a 3 J-i (37) 
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The coefficient of A 3 j>_! in (36) is 

1 T (“ + 1 )» . 2(« + 1X«X« ~ 1) _ «Qi - 1) 1 

2 l U ~ 2 + 6 2 J 

= 1 j «(« - 1)(« - 2) j 



which is the coefficient in (37). It is left to the reader to show that the 
other coefficients of (36) and (37) are identical. 

14.5 THE INTERPOLATION FORMULAE 

Using the lozenge diagram of Table 14.5 and the four basic rules it is 
now a simple matter to derive any interpolation formula by choosing 
a path through the table from left to right. Note that a step from right 
to left can be included if the product is subtracted instead of added. 
Of the many formulae that can be obtained, four have already been 
mentioned, the two Newton-Gregory and the two Gauss formulae (see 
(27), (28), (29), (30), of section 14.4). 

It has been noted that the Newton-Gregory formulae are useful for 
interpolation near the ends of the table. The Gaussian formulae are 
useful near the centre of the table but are superseded by two other 
formulae which proceed through the table via horizontal paths. 

14.5.1 The Stirling formula 

This is a formula obtained from the path starting at_y 0 (see Table 14.5) 
and proceeding through the table on a horizontal path. Using rules 2, 
3, and 4, 

y = y ° + 5 (?) v*-' + +? [(“ t ') + ( 2 )] A ’y- 

+ 5 (“3 ') [AV. + A’y-J 

•sidVt:')]**-*- 

u(Ay_! + Ay 0 ) u 2 u(u 2 - 1) (A 3 y_ 2 + 

To + 2 '2 ^T-i '31 2 

+ “ 2( “ 2 4 r.- Ai y -2 + — (38) 

Note that this is just the average of the two Gauss formulae (29) and 
(30) of section 14.4. 
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14.5.2 The Bessel formula 

This interpolation formula is obtained by taking a path that starts 
midway between the entries y 0 and y ± and proceeds on a horizontal path 
through the table. 



KEY 

A/, = Newton forward G 2 = Gauss backward 

N z = Newton backward S = Stirling 

G, = Gauss forward B = Bessel 

Fig. 14.3 

From Table 14.5, using rules 2, 3, and 4, 

= I0-. + J-0 + j [(") + (“ 7')] Ay. 


i(“) [ A^ 1 + A-y. 1 + i[(“7 1 ) + (“)]Ar. 
(“ 7 ') [AV- + AV-J + . . • 


T 2 

-H Z ‘ + ( “-»A^ + 

u(u - j)(n - 1) 


u(u - 1) (A 2 );,! + A 8 y 0 ) 
2 2 


+ - 


3! 


A 3 /_i + . . 


(39) 


A particularly simple form of (39) may be used to compute the value 
of y midway between the given values y 0 and y x . With u = 1, the 
coefficients of all the odd differences are zero and the formula becomes 
v = yo+yi _ 1 A *y_i + A 2 y 0 3 Ay_, + A«y, 1 
7 2 8 2 + 128 2 + ' 

, , 1Nn [1 - 3.5 . . . . (2n - l)] 2 A 2n y_ n + A 2 *y_ n+ 

+ 1 ] 2 2n (2n)! 2 


- + . . . (40) 
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The result (40) is called the formula for interpolating to halves. 

The paths taken through the table to derive the named formulae of 
interpolations are shown together in Fig. 14.3. 

Example (i) Use Stirling’s formula to compute tan (89° 26') given the 
following table of values of tan x. 
x 1 89° 21' 89° 23' 89° 25' 89° 27' 89° 29' 

tan x | 88 14 92-91 98-22 104-17 110-90 

Solution Form the difference table and apply formula (38) with 
* = 89° 26', x 0 = 89° 25', h = 2', and u = 0-5. 

x y Ay A 2 y A 3 y A *y 

89° 21' 88-14 

4- 77 

89° 23' 92-91 0-54 

5- 31 0-10 

89° 25' 98-22 0-64 0-04 

5- 95 0-14 

89° 27' 104-17 0-78 

6- 73 

89° 29' 110-90 

Retaining all decimal places, 
tan (89° 26') = 98-22 + 0-5(5-63) + K0-25)(0-64) 

+ i(0-5)(-0-75)(0-12) + A(0-25)(—0-75)(0-04) 

= 98-220 + 2-815 + 0-080 - 0 007 5 - 0-000 312 5 
^ 101-107. 

Example (ii) Use Bessel’s formula to estimate ^46-24 given the follow¬ 
ing table of y = tyx. 

x I 41 45 49 53 

y = tyx I 3-448 2 3-556 9 3-659 3 3-756 3 

Solution Form the difference table and use formula (39) with * = 46-24, 
* 0 = 45, h = 4-0, and u = 0-31. 

x y Ay A 2 / A 3 y 

41 3-448 2 

0-108 7 

45 3-556 9 -0-006 3 

0-102 4 0-000 9 

49 3-659 3 -0-005 4 


53 


3-756 3 


0-097 0 
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Then, 

^46-24 ^ 3-608 1 + (-0-19)(0-102 4) 

+ «0-31)(—0-69)(—0-005 85) 

4- £(0-31)(—0-19)(—0-69)(0-000 9) 

= 3-608 1 - 0-019 456 + 0 000 626 + 0-000 006 
^ 3-589 3. 

Example (iii) Form a cubic polynomial that may be used to replace 
sin x in the range 0-90 < x < 1-20 given the four values 


X 

0-90 

1 00 

1-10 

1-20 

sin x 

0-783 3 

0-841 5 

0-891 2 

0-932 0 

Solution Form the difference table, 



X 

y 

a 7 

A 2 v 

A 3 y 

0-9 

0-783 3 

0-058 2 



1-0 

0-841 5 

0-049 7 

-0-008 5 

-0-000 4 

M 

0-891 2 

0-040 8 

-0 008 9 


1-2 

0-932 0 




and use Bessel’ 

s formula (39) with x 0 

= 1-0, h = 

0-1, then, 


sin (1 + 0-l«) = 0-866 35 + (u - J)(0-049 7) 

+ i«(« - 1)(—0 008 7) 

+ Mu ~ i)(u - IK-0 000 4) 

^ 0-841 50 + 0-054 02« - 0 004 25u 2 - 0 000 07u 3 . 
Putting u = lOv then, 

y = 0-841 50 + 0-540 2v - 0-425i> 2 - 0-07t; 3 

may be used as the cubic expression to represent sin (1 + v) in the 
range —0-1 < v < 0-2. Finally, replacement of v by x— 1 gives the 
cubic polynomial in x that represents sin x. 

14.5.3 The accuracy of the formulae 

Each formula derived for the function f(x 0 + uh) is an approximating 
polynomial. If the actual function f(x) is a polynomial in x then, 
providing the difference table can be taken far enough (i.e. that enough 
data are given and used) all the formulae are equally accurate and will 
give an exact result (assuming the original data to be exact). 

If the function f(x) is not a polynomial then the difference table 
should first be examined to see that the successive differences A_y ( , A 2 /,, 
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A 3 _y„ . . . decrease, which means that successive terms in the poly¬ 
nomial approximation also decrease. Some indication of the accuracy 
to be expected is often given by an examination of the difference 
columns. If the differences decrease rapidly and the rth difference 
column is approximately constant then it is reasonable to expect that 
the formulae will give results as accurate as the given data. If the 
differences slowly decrease or there is no column of (approximate) 
constants then the reader should be aware that a problem of accuracy 
does exist. When the differences increase it is highly probable that the 
interpolation is impossible, the formula used being part of a divergent 
series. Interpolation in the latter case might be possible if the differences 
increase so slowly that the binomial coefficients in the formulae are 
able to control the size of the terms and ensure a convergent series. 
The difficulty of knowing whether a result is approximately correct or 
not is usually removed by an examination of the remainder term of the 
interpolating series used. The discussion involved in this examination 
is not considered here. 

The choice of interpolating polynomial depends on the position in 
the table of the interpolated value. It is usual to keep u as small as 
possible but to use a path through the table that gives a polynomial 
formula using all the available information. The following statements 
read in conjunction with Fig. 14.3 may be taken as general rules: 

Rule 1 If x 0 is at the start of the table, i.e. y is known for argument 
values x 0 , x u x 2 , . . ., and interpolation is required near the beginning 
of the table then the best interpolation formula is the Newton-Gregory 
forward formula. 

Rule 2 If x 0 is at the end of the table, i.e. y is known for argument 

values x 0 , x- lt x. a .and interpolation is required near the end of 

the table then the best result is obtained from the Newton-Gregory 
backward formula of interpolation. 

Rule 3 If x 0 is in the centre of the table, i.e. y is known for argument 
values on either side of x 0 and interpolation is required near the centre 
of the table then the best formula to use is either that of Stirling or of 
Bessel. 

If interpolation is required at either side of x 0 for the range 
— i < u +J, then the Stirling formula should be used. 

If interpolation is required near the middle of the interval ( x 0 , JCj) 
for the range J < u < f, then the formula of Bessel should be used. 

Example Form the difference table from the given data for y = sin x° 
and estimate y when (i) x = 12, (ii) x = 31, (iii) x = 37, (iv) x — 45. 
x I 10 20 30 40 50 60 

y I 0-173 65 0-342 02 0-500 00 0-642 79 0-766 04 0-866 03 
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Solution In the following difference table the decimal point is under¬ 
stood, the fifth difference for example being 0 000 18. 


* 

y 

A y 

A 2 y 

A 3 y 

A *y A 6 / 

10 

17365 

16837 




20 

34202 

15798 

-1039 

-480 


30 

50000 


-1519 


45 



14279 


-435 

18 

40 

64279 

12325 

-1954 

-372 

63 

50 

76604 

9999 

-2326 



60 

86603 





(i) For 

x = 12, 

take x 0 = 

10 then, since 

h = 10, 

u = 0-2 and the 


section 14.4 

y( 12) 0-173 65 + (0-2)(0-168 37) + (-0-010 39) 

+ ™ ( —0-004 80) 

+ (0-2)(—0-8)(—1-8)(—2-8) (Q-0()0 4 g } 


(02)(—0-8)(—1-8)(—2-8)(—3-8) 


120 


( 0-000 18 ) 


= 0-173 65 + 0-033 674 + 0 000 831 2 - 0-000 230 4 
- 0-000 001 5 + 0 000 004 5 
= 0-207 93 

The result correct to five decimal places is known to be 0-207 91. The 
estimate is in error by less than 0 01 per cent. 

(ii) For x = 31, take x 0 = 30, h = 10, then u = 0-1 and the Stirling 
formula (38) of section 14.5.1 gives 

j(31) ^ 0-5 + (0-l)(0-150 385) + (—0-015 19) 


+ 


(0-l)(—0-99) 


(0-004 575) + 


(0-01)(- 0-99) 
24 


(0 000 45) 


= 0-500 00 + 0-015 038 5 - 0 000 075 95 
- 0-000 075 49 - 0 000 000 19 
= 0-514 89. 
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The correct result is 0-515 04 and the estimate is in error by less than 
0-03 per cent. 


(iii) For x = 37, x Q = 30, h = 10, then u = 0-7 and Bessel’s formula 
(39) gives 

7(37) ^ 0-571 395 + (0-2X0-142 79) + (0 ' 7 ><~°' 3 > (_ 0 -017 365) 


(0-7)(0-2)(—0-3) 


(-0-004 35) 


(0-7)(-0-51)(-l-3) 

24 

(0-7)(0-2)(—0-51)(—1-3) 
120 


(0 000 54) 


(0-000 18) 


= 0-571 395 + 0-028 558 + 0-001 823 325 
+ 0-000 030 45 + 0-000 010 442 + 0 000 002 
= 0-601 82. 

The correct result, to five decimal places, is 0-601 81 and there is an 
error less than 0-002 per cent in this estimate. 

Using the Lozenge diagram the reader should verify the derivation 
of the sixth term in the formula. 

(iv) For x = 45, x 0 = 40, h — 10, then u = 0-5 and using Bessel’s 
formula (40) gives 

7(45) =cz 0-704 415 - £(-0 021 40) + ih (0 000 72) 

= 0-704 415 + 0-002 675 + 0-000 017 
= 0-707 11. 

In deriving the average coefficient 0-000 72 of the third term of 7(45) 
it has been assumed that A s 7 , is constant at 0-000 18, i.e. the coefficient 
0-000 72 is the average of 0-000 63 and 0-000 81. The estimated value is 
correct to five decimal places. 

Finally it is possible to give an indication of the superiority that the 
central difference formulae of Stirling and Bessel have over the Newton- 
Gregory formulae by a comparison of the coefficients of the terms. 
Consider the case of u = 0-3 in the formulae (27) of section 14.4, (38) 
of section 14.5.1, and (39) of section 14.5.2. The three formulae based 
on the value x = x 0 are as follows: 

Newton-Gregory: 

7(x 0 + 0-3 h) = 7o + 0-3A7 o - 0-105A 2 7 o + 0-059 5A 3 7„ 

- 0-040 16A 4 7o + . . . 
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Stirling: 

y{x o + 0*3/t) =y 0 + 0-3 (Aj - 1 2 + Ayo) + 0-045A 2 j_, 

- 0-045 5 ^ - 0-003 412 5A 4 y_ 2 


Bessel: 

y(x 0 + 0-3/2) = _ (0-2)Aj 0 - 0-105 (A2 - V - 1 2 + A2yo) 

+ 0-007A 3 /., + 0-019 3 * A4j; ~ 2 2 A4 - y ~ l) + . . . 

It can be observed that after the first few terms the coefficients in the 
Stirling formula decrease more rapidly than those of the Bessel formula 
which in turn decrease more rapidly than those of the Newton-Gregory 
formula. The contribution from the higher order terms of the Stirling 
formula is much less than that of the Newton-Gregory formula. If the 
differences are only decreasing slowly or if there are round-off errors 
in the initial data (which are magnified in the higher difference columns, 
see section 14.2.1) it is preferable, whenever possible, to use a central 
difference formula such as that of Stirling or Bessel. 

14.6 INVERSE INTERPOLATION 

Inverse interpolation applied to a set of tabulated values (x lt y t ), 

i = — n .— 1,0, +1,. . ., +n requires an estimation of x for some 

given y in the range < y < j +n . In essence the process reduces to 
solving the interpolation formula y — p(x) for x, given the value of y. 
The formulae of section 14.5 have all been expressed in the form 
y = p{x 0 + uh) and it usually easier to solve for u and then to determine 
x. The process is simply that of determining the root of a polynomial 
equation, a process discussed in some detail in Chapter 13. 

The interpolation formula used depends on the position in the table 
of the value required. Only one method of attack is discussed here 
namely the iterative method of section 13.2.2. The equation p(x 0 + uh) 
= constant is written in the form 

u=m 

and the iterative process 

«r + l = <£(«r) 

is used. 
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Example Find the value of x for which y = 10 in the following table 
x I 2-8 2-9 3-0 3-1 3-2 

y I 8-252 7 9-114 6 10-067 7 11-121 5 12-286 6 

Solution Form the difference table 

x y Ay A 2 y A 3 / A 4 y 


2-8 

8-252 7 






0-861 9 



2-9 

9-114 6 


0-091 2 




0-953 1 


0-009 5 

3-0 

10-067 7 


0-100 7 




1-053 8 


0-010 6 

3-1 

11-121 5 


0-1113 




1-165 1 



3-2 

12-286 6 





Because the value y = 10 is near the centre value the best interpola¬ 
tion formula to use is that of Stirling. With x 0 = 3, h = 0-1, and 
x = 3-0 + 0-lw then equation (38) leads to the approximation 
10 = 10-067 7 + «(1 003 45) + k 2 (0-050 35) 

+ — 3 ~ 1} (0-01005) + “ 3( ^~ 1} (0-001 1), 


which may be rewritten in the form 


0-067 7 , a (0-050 35) 
1-003 45 + U 1-003 45 


+ u(u 2 - 1) 


(0-001 675) 
1-003 45 


+ « 2 (« 2 - 1 ) 


(0 000 046) 
1-003 45 


= m 


For the purposes of calculation, put u = —v then 
v = 0-067 467 + z> 2 (0-050 177) + u(l - u 2 )(0 001 669) 

- i> 2 (l - u 2 )(0-000 046) 

and the iteration proceeds as follows 

v 0 = 0 v 3 = 0-067 810 6 


Vi = 0-067 467 Vi = 0-067 810 6 

u 2 = 0-067 807 

The value of u may therefore be taken as —0-067 81 from which 
x = x 0 + uh = 2-993 2 to four decimal places. The table is, in fact, 
one for y = cosh x and the result correct to four decimal places is 
known to be 2-993 3. The error in the calculated value is 0-003 per cent. 
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14.7 NUMERICAL DIFFERENTIATION 

This is the process of calculation of the derivative dyjdx from given 
data (x„ y t ), when the actual relationship between x and y is unknown. 
The procedure is to replace the true relationship y = fix) by the best 
approximating polynomial y = p{x) and then to differentiate the latter. 

The interpolation polynomial to be used depends as usual on the 
position in the table of the value at which the derivative is required. 

The interpolation formulae obtained in the previous sections have 
all been expressed in terms of the variable u rather than x, in the form 
y(x) =/(xo + uh) mpix) 
and differentiation gives 

dy _ dy du _ 1 dy 1 dp 
dx dudx~ hdu h du 


(41) 

(42) 


Further derivatives may be obtained, for example 
d 2 y^ 1 d 2 p 
dx? — ¥du*‘ 


(43) 


Example From the data given, compute (i) dy/dx when x = 3-21, 
(ii) d 2 y/dx 2 when x = 3-8. 


X 

3-0 

3-5 

4-0 

4-5 

y 

1-484 30 

1-550 23 

1-607 46 

1-658 01 

Solution Construct the difference table 



X 

y 

Ay 

A 2 y 

A 3 y 

3-0 

1-484 30 

0-065 93 



3-5 

1-550 23 

0-057 23 

-0-008 70 

0 002 02 

4-0 

1-607 46 

0-050 55 

-0 006 68 



4-5 1-658 01 


(i) To find dy/dx at x = 3-21 near the start of the table use the Newton- 
Gregory formula (27) based on x 0 = 3, h = 0-5, and u = 0-42 then, 


y — 1-484 30 + u(0 065 93) + 
. u(u - 1)(» - 2) 


u(u - 1) 


(-0 008 70) 


(0 002 02 ) 


and with (42), 

0-5 dy/dx — [0-065 93 - (2 u - l)(0-004 35) 

+ (3m 2 - 6m + 2)(0-000 336)] u=0 42 
= 0 065 93 + 0-000 696 + 0-000 003 
= 0-066 629. 
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Thus dy/dx =c= 0-133 2, when x = 3-21. 

(ii) To find d 2 y/dx z at x = 3-8 near the centre of the table use the 
Bessel formula (39) based on x 0 = 3-5, h = 0-5, and u = 0-6, then 

y — 1-578 845 + (u - i)(0-057 23) + \u(u - 1)(—0-007 69) 

+ K«X«-iX«- 1X0-002 02) 

and 

0-5 dyjdx — 0-057 23 + (m - |)(-0-007 69) 

+ K3u 2 - 3m + i)(0 002 02) 

and so, 

(0-5) 2 ^ — [-0-007 69 + (m - *)(0 002 02)] u _ O6 

or [ 3 ] i = 3. 8 - (_0 ' 007488)/0 ' 25 

= -0-029 9(52). 

The given data are those of the function 

y — logm (lOx + 0-5) 

from which 

0 ,33 22 and [ S ]„ a< -- 0 ' 02930 

and the estimated value in (i) is in error by approximately 0-03 per cent 
while that in (ii) is in error by over 2 per cent. 

The process of successive differentiation has the effect of exaggerating 
the errors involved in the original approximations, i.e. if the data are 
accurate to a specified number of decimal places then the derivative 
obtained by the above process is usually accurate to a lesser number of 
decimal places. 

14.7.1 Maxima and minima 

To find the approximate turning value of a function it is necessary to 
form the approximate derivative, equate to zero and solve the resultant 
equation for u (and hence for x). 

Example Find the turning point, given the following data, 
x I 2 3 4 5 6 

y I 31-187 5 12-027 5 2-865 3 3-705 2 14-544 0 
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Solution Form the difference table in the usual way. A rough sketch 
shows that there is a minimum in the region jc = 4-5. 


X 

y 

Ay 

A 2 / 

A 3 y 

A *y 

2 

31-187 5 

-19-1600 




3 

12-027 5 

-9-162 2 

9-997 8 

0-004 3 


4 

2-865 3 

0-839 9 

10-002 1 

-0-003 2 

-0-007 5 

5 

3-705 2 

10-838 8 

9-998 9 



6 

14-544 0 





Using Bessel’s 

formula (39) of section 14.5.2 with x 0 = 

4-0, h = 1, 


y ^ 3-285 25 + (u - £)(0-839 9) + 


(»)(« ~ 1) 


(10-000 5) 


+ u( „- i )(— d ( _ 0 0032) 


and A 0-839 9 + (u - JX10-000 5) + <3u‘ -3a + })(-0000 53). 

An approximation to the value of u corresponding to the minimum 
point is given by solving the approximate equation 

0-001 6« 2 - 10-002 In + 4-160 6 = 0. 

Writing this in the form 

n = 0-416 0 + 0-000 16u 2 

the approximate root is 0-416 from which it follows that x — 4-416. It 
is left to the reader to show that the corresponding minimum value is 
given by y = 2. 


14.8 NUMERICAL INTEGRATION 

This is the process of forming a definite integral of y from data given 
by a set of tabulated values of x and y. The process is frequently called 
integration by quadrature. The data may be experimental, in which case 
the actual relationship y = f(x) is unknown but the integral f f(x) dx 
is required over some range of argument within the range of given data. 

Frequently however it is necessary to compute the integral J f(x) dx 
for a known function f(x) for which the methods of integration dis¬ 
cussed in Chapter 6 do not apply. In this case the values of f(x ) are 
tabulated at intervals of the argument x within the range (a, b ) so as to 
form the required data. 
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An interpolating polynomial is now used to replace the function f(pc), 
and the approximation obtained by term by term integration. Instead 
of using one polynomial to replace fix) in the whole range (a, b) the 
usual procedure is to write 

J fix) dx = J fix) dx + J fix) dx + ... + j fix) dx 

where a < < a a < ... < a r < b and then to replace fix) of each 

sub-range by interpolation polynomials of the same kind. The error in 
the approximation depends on how closely the interpolation poly¬ 
nomials approximate the function fix). Since integration is a summation 
process it is reasonable to suppose that errors in the original data, some 
of which may be positive and some negative, tend to be smoothed out. 

There are many different kinds of quadrature formulae each based 
on integration of a particular kind of interpolating polynomial. The 
simplest to use and often the most effective are those based on the 
forward Newton-Gregory polynomial. 

14.8.1 Quadrature formulae based on the Newton-Gregory polynomial 

Consider a function y given at n equal intervals of the argument x from 
x 0 to x 0 + nh then, since x = x 0 + uh, dx — hdu, the integration of 
the forward Newton-Gregory formula (27) leads to 

n * - * £„[•» + (r) 4 * + G) 4 * + (;) 4 ’*+• • • 

(“)** + ...]* ( 44 ) 

The general term of equation (44) involves the integration with respect 

^ — J X« — 2) . . . (w — r + 1) du. 

It is possible to integrate such an expression for arbitrary r but for the 
simple quadrature formulae based on equation (44) there is no need for 
such generality. 

Suppose it is required to evaluate the integral affix) over an interval 
from x = a to x = b. The first step is to subdivide the interval (a, b) 
into a number of equal parts. If there are N such parts the argument 
values may be written 

a = x 0 , x u x: 2 , . . ., x N = b 
with x, — x r _j = h and b — a — Nh. 
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The second step is to take the argument (and ordinate) values in 
groups of p at a time, i.e. 

(* 0 , x u . . ., (xp_!,. . x 2 p -i)‘, . . . ( . . . x N ) 

and then to apply the result (44) to each group in turn with n—p— 1. 
A different integration formula will result for each different value of p. 
The most useful results are those obtained with p = 2, p = 3, and 
P = 7. 

14.8.2 The trapezoidal rule 

This is the case with p = 2, when the N + 1 argument values are 
grouped in pairs (x 0 , x x ), (* b x a ) . . . (*„_!, x N ) and equation (44) is 
applied with n = 1. The polynomial in the integrand of equation (44) 
is assumed to be of the first degree, all differences after the first being 
taken as zero. Thus 

ydx^h £ Oo + uky 0 )du = h(y 0 + y 0 ) 

= h[y 0 + Kfi - To)] 

= POo + yi), 
ydx^ P(Ti + y 2 ). 


ydx^i_h(ys-i +Tv)- 

Hence 

J y dx — j ydx = POo + 2y 1 + 2y 2 + . . . + 2yu-\ + Tv)- 

(45) 

This is the trapezoidal rule. It is a simple result to apply but is not 
particularly accurate unless h is made extremely small (and N very 
large). Geometrically, the process of integration 

JV)^ 

can be represented as the area under the curve y = f{x) and the trape¬ 
zoidal rule simply replaces the curve by a set of chords through the 
points (x„Tr) and (x r+1 , y r +i) with r = 0,1,. . ., N — 1 and then 
approximates to the actual area under the curve by adding together 
the trapezium areas as in Fig. 14.4. 



NUMERICAL METHODS II 


385 


14.8.3 Simpson’s rule 

This is the case with p — 3, for which the TV + 1 argument values are 
grouped, three at a time, i.e. 

(*o, *i, * 2 ); (x 2 , x 3 , x 4 ); . . ., (x w _ 2 , **_!, x N ) 

It is evident that TV, the number of intervals, must be even and the 
number of ordinates must be odd. The integration (44) is applied to 



each group with n = 2. The integrand is assumed to be of the second 
degree all differences after the second being omitted. This leads to 

J v dx =£= h J [y 0 + uAy 0 + \u{u — l)A 2 _y 0 ] du 

= h[2y 0 + 2A_y 0 + JA 2 ^] 

= h l 2 y° + 2 Oi - To) + i(y 2 - 2y 1 + _y 0 )] 

= i%o + 4 y x + y 2 ] 

and to 

J x ydx— 5 fj 2 + 4j 3 + j 4 ]. 


•L ydx ~\ [yN ~ 2 + 4 yi, ~ i + y "l- 
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Adding the results gives 
h 

y dx — ^ + yti) + 2(^2 + y* + ■ • . + 

+ 4(yi + + • • • + J'w-i)] (46) 

This simple and reasonably accurate formula is known as Simpson's 
rule. The reader should note the grouping of the terms inside the square 
bracket in equation (46): the sum of the first and last ordinates, twice 
the sum of the even ordinates, four times the sum of remaining odd 
ordinates. 

Geometrically the process consists of passing arcs of second degree 
polynomials, i.e. parabolas with axes parallel to the y-axis, through 
successive trios of points and evaluating the area under these parabolas 
as an approximation to the area under the actual curve of y = f(x). 

14.8.4 Weddle’s rule 

This is the case of p = 7 (the intermediate cases p = 4, 5, 6 lead to 
formulae that lack simplicity), for which the N + 1 argument values 
are grouped seven at a time, i.e. 

(* 0 , x u x 2 ,. . ., x 6 ); (x 6 , • • x la );...(..., x N ). 

Here N must be an integral multiple of six. The integration (44) is now 
applied to each group with n = 6 so that in (44) all differences after 
the sixth are assumed to be zero. Then, for the first group 

J* ydx^hj jy 0 + uAy 0 + \u{u - l)A 2 y 0 

+ £«(« - 1)(« - 2)A 3 y 0 + ... + (“) A e y 0 ] du 
= h[6y 0 + 18Ay 0 + 27A 2 y 0 + 24A 3 y 0 + WA 4 y 0 
+ itA 5 y 0 + -iVo A 6 y 0 ], 

Before proceeding with the substitution in terms of the ordinates 
To, yi, y 2 , ■ ■ ye, the last term is amended to (3A 6 y 0 )/10 instead of 
(41A 6 y 0 )/140. This introduces an error of (AA 6 y o )/140 into the evalua¬ 
tion, which will be insignificant providing that h and A 6 y 0 are small. 
Replacing the differences by their expressions in y 0 ,yi,yz, ■ ■ • ye leads, 
after some reduction, to 

ydx^ -roh(y Q + + y a + 6y a + y 4 + 5y 0 + y 6 ). 

Similarly, for the second group of terms 

£ ydx^ -foh(y 6 + 5 yr, + y B + 6 y a + y 10 + 5y ±1 + y 12 ) 
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Repeating for each successive group of terms and adding leads to 

1 ydX = I " ydX 

= -hh [i c t y^ (47) 

where 

c, = 1, 5, 1, 6, 1, 5; 2, 5, 1, 6,1, 5; 2,5; 

2, 5, 1, 6, 1, 5, 1. 

This is Weddle's rule. The coefficients c, are most simply remembered 
in groups, i.e. 

first group: 1, 5, 1, 6, 1, 5 
all interior groups: 2, 5,1, 6, 1, 5 
last group: 2, 5,1, 6, 1, 5,1 

The geometrical interpretation is that the portion of the curve 
y = fix) through each successive group of seven points is replaced by 
an arc of a sixth degree polynomial through these seven points. The 
rule is more accurate but requires more calculation than does Simpson’s 
rule. 

14.8.5 Quadrature formulae based on other interpolation polynomials 
Further formulae can be produced dependent on the integration of the 
Stirling, Gauss, and Bessel interpolation polynomials. These results, 
while providing rapidly converging quadrature formulae, have the dis¬ 
advantage of difficulty in practical application. Such formulae are not 
discussed here partly because of this disadvantage and partly because 
they are superseded by more sophisticated techniques which have 
application to computer programs. 

14.8.6 Application of trapezoidal, Simpson’s and Weddle’s rule 

As stated earlier if, as the result of an experiment, data are available 
in the form (x t , y,), i = 0, 1,2,. . ., N then an approximation to the 
integral of f{x) from x 0 to x N can be obtained by immediately applying 
the results of the previous sections. If it is required to evaluate the 
integral of some known function then the data must be computed before 
proceeding. In each case the accuracy depends, in geometrical terms, 
on the accuracy with which the areas under the polynomials approxi¬ 
mate to the areas under the actual curve. An estimate of the error 
involved may be easily found. 

In the case of the trapezoidal rule the principal part of the error in 
each integration is the first term neglected in the integral (44), i.e. the 
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is the first term of 

‘f{6K+GK+- 

or 

The total error in the use of the trapezoidal rule is 

(A 2 / 0 + A^ + . . . + A%_0 (48) 

In using Simpson’s rule the actual error in the integral from x 0 to 
x a is 

h i {(3) A3/o + 0 A4y ° + ' ‘ •} du 

N °w, j* 2 ^ du = J] £* «(« - 1X« - 2 ) du 

= jj [4 - 8 + 4] 

= 0 

and the principal part of the error is given by 

*ro *** 

which reduces to — ^ A 4 y 0 . 

The total error involved in the use of Simpson’s rule is then 

- ^ [ A4 To + A 4 j 2 + Ay 4 + . . . + A%_ 2 ], (49) 

The fact that the error depends only on the fourth order differences 
means that Simpson’s rule will be exact when f(x) is a polynomial of 
the first, second or third degree. 

In applying Weddle’s rule the principal error arises from the omission 
of the term — ^ A%. 

The total error is then 

- [A 6 Jo + A6 Te + ■ • •] (50) 

Weddle’s rule will be exact for the integral of fix) when the latter is a 
polynomial of up to the fifth degree. 
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This chapter is concluded by examples of evaluation of an integral 
by each of the three rules. 

Example (i) Estimate the value of J sin x dx using a sub-division of 

OT in the argument, (i) by the trapezoidal rule, (ii) by Simpson’s rule, 
and (iii) by Weddle’s rule. Compare the results with the exact value 
obtained by integration. 

Solution This example is used merely as an illustration of the numerical 
processes; the correct value of the integral may be obtained using tables 
of trigonometric functions, i.e. 

J sin x dx — J^—cos xj 

= 1 — cos 0-6 
= 1 - 0-825 34 

= 0-174 66 to five decimal places. 

To find estimates by the numerical processes, a table of values is 
required. This is given below to five decimal places, 
xr I 0 0-1 0-2 0-3 0-4 0-5 0-6 

sin x I 0-000 00 0-099 83 0-198 67 0-295 52 0-389 42 0-479 43 0-564 64 

(i) Evaluating by the trapezoidal rule (45) of section 14.8.2 with N — 6, 
h = 0-1, gives an approximate value of 

K0-l)[0-564 64 + 2(1-462 87)] = K0T)(3-490 38) 

= 0-174 52 to five decimal places. 

(ii) Evaluating by Simpson’s rule (46) of section 14.8.3 with N = 6, 
h — 0T gives an approximate value of 

J(0-l)[0-564 64 + 2(0-588 09) + 4(0-874 78)] = J(0-l)(5-239 94) 

= 0-174 665 

or 0-174 67 to five decimal places. 

(iii) Evaluating by Weddle’s rule (47) of section 14.8.4 with N = 6, 
h = 0-1 gives an approximation of 

-A(0‘l)[0-0 + 0-499 15 + 0-198 67 + 1-773 12 

+ 0-389 42 + 2-397 15 + 0-564 64] = -A(0-l)(5-822 15) 

= 0-174 664 

or 0-174 66 to five decimal places. 

The first two results are in error by approximately 0-08 per cent, 0-006 
per cent. The evaluation by Weddle’s rule is correct to five decimal 
places. 
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Example (ii) Evaluate the integral J exp (—x 2 ) dx using a sub¬ 
division of 0-2 in the argument and estimate the error involved. 

Solution This integral appears in statistical analysis and cannot be 
evaluated by using the standard methods of integration. Form the 
difference table for y = exp (—x 2 ) from x 0 = 0 to x N — 1-2 at equal 
intervals of h = 0-2. 


0 

0-2 

0-4 

0-6 

0-8 

10 

1-2 


y 

1 000 0 
0-960 8 
0-852 1 
0-697 7 
0-527 3 
0-367 9 
0-236 9 


-0-039 2 
-0-108 7 
-0-1544 
-0-170 4 
-0-159 4 
—0131 0 


A *y A 3 y 


-0-069 5 
-0-045 7 
-0-016 0 
+0-0110 
+0-028 4 


0-023 8 
0-029 7 
0-027 0 
0-017 4 


A l y 


0-005 9 


-0-009 6 


A s y 


-0-008 6 
-0-006 9 


A«y 


+0-001 7 


(i) The trapezoidal rule 

From equation (45) of section 14.8.2 with h = 0-2, N = 6, the approxi¬ 
mate value of the integral 

J exp (—x 2 ) dx 

is given by 

0-1 [1-236 9 + 2(3-405 8)] = 0-804 85. 

(ii) Simpson’s rule 

From equation (46) of section 14.8.3 with h = 0-2 and N = 6 the 
approximation to the integral is 
i (0-2)[l-236 9 + 2(0-852 1 + 0-527 3) 

+ 4(0-960 8 + 0-697 7 + 0-367 9) 

= i (0-2)[12-101 3] 

= 0-806 7(54). 

(iii) Weddle’s rule 

From equation (47) of section 14.8.4 with h = 0-2 and N = 6 the 
approximation to the integral is given by 
0-06(1-000 0 + 4-804 0 + 0-852 1 + 4-186 2 

+ 0-527 3 + 1-839 5 + 0-236 9) = 0-806 7(60). 
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An estimation of the error involved in each case may be obtained 
from the equations (48), (49), and (50) and the difference table. From 
(48), the estimation of the error in the trapezoidal rule is 
—(0-2)(—0-091 8)/12 = +0-001 5(3) 
and 0-806 4 should be a better result than 0-804 8. 

From (49), the estimation of the error in Simpson’s rule is 
—(0-2)(—0-006 4)/90 = +0-000 0(14) 

which is zero to four places of decimals. Since round-off errors in the 
entries are propagated into the higher differences the result estimated by 
Simpson’s rule is probably between 0-806 7 and 0-806 8. 

Similarly from (50) the estimated error in Weddle’s rule is 
-(0-2X0-001 7)/140 = -0 000 0(02) 

and the result cannot be improved (to four decimal places). 

From a practical point of view the most reliable method of improving 
an estimation is to sub-divide the interval of the argument and recalcu¬ 
late the approximation. An examination of the two results will immedi¬ 
ately show how many decimal places may be claimed with accuracy. 
If in the present example the values of exp (— x 2 ) are calculated to four 
decimal places at the values of x — 0-1, 0-3, . . ., 1-1, inserted in the 
table and Simpson’s rule used with 13 ordinates, the estimated value 
is found to be 0-806 7(36). The conclusion would be that, with the data 
given, the integral has the value 0-806 7 to four decimal places. 

This integral can be evaluated by other means and its value is 
0-806 74 correct to five decimal places. Any difference between the 
estimated value using Simpson’s rule (or Weddle’s rule) and the actual 
value would be due entirely to round-off errors in the column of entries. 

14.9 GENERAL REMARKS 

In any numerical work it is worth noting that to avoid introducing 
further errors the intermediate calculations are performed using as 
many decimal places as possible, or convenient, but that the result of 
all the calculations is rounded off to no more than the number of 
decimal places in the given data. If interpolation is required in a table 
given to four decimal places, the calculations are performed using at 
least five decimal places and the result rounded off to four decimal 
places. It is impossible to claim a result correct to more decimal places 
than are given in the original information. 

The original data frequently have round-off errors present and as in 
section 14.2.1 these errors are magnified in the higher differences. The 
result of a calculation may therefore have an error in the final decimal 
place claimed. Refer also to sections 13.5 and 13.6. 

The calculations in this, and the next, chapter have been performed 
using a desk calculator and, as the reader will find on carrying out these 
or similar calculations, the mistakes that will occur are due to the 
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operator and not the machine. The commonest mistakes made are tc 
enter the wrong number in the machine, e.g. 8432 instead of 8342, or 
to fail to account correctly for a decimal point. Such mistakes can only 
be avoided by checking each entry made and formulating a definite 
routine for each calculation. For a final check on the work it may be 
necessary to recalculate completely or, preferably, have someone else 
repeat the calculations. 

Familiarity with a machine is the best aid to correct calculations and 
this can only come with practice. 

EXERCISE 14 

1 There is a cubic relationship between x and y given in the following 
table. Show that there are two errors in the table, locate them and 
correct the table. 


X 

0 

1 

2 

3 

4 

5 

y 

97-336 

132-651 

175-616 

226-981 

287-496 

357-911 

X 

6 

7 

8 

9 

10 

11 

y 

438-977 

531-441 

636 046 

753-571 

884-736 

1030-301 


2 Form a difference table from the following data and deduce that 
f(x) is a quadratic form. Use the table to find the value of /(0). 
Use the Newton-Gregory forward interpolation formula based on 
x = 0 to find the quadratic function fix). 

x I 2 3 4 5 6 

f(x) | 3-00 4-28 5-88 7-80 1004 

3 Use the backward Newton-Gregory interpolation formula to find 
the cubic function of x which takes the values 2, 11, 32, 71, when x 
has the values —6, —4, —2, 0 respectively. 

4 From the following table use an appropriate interpolation formula 
to estimate/(16-4) and/(23-5) 

* I 16 18 20 22 24 

fix) | 261-3 293-7 330-0 372-2 422-3 

5 The following is an extract from a table of sinh x. Use the forward 
Newton-Gregory interpolation formula based onx = 4-60 to esti¬ 
mate the value of sinh 4-612. Also evaluate using the Stirling 
formula based onx = 4-61. Compare and comment on the results. 

x I 4-60 4-61 4-62 4-63 

sinh x | 49-737 50-237 50-742 51-252 

6 Use an appropriate interpolation formula to estimate the values of 
/(0-3),/(2-55),/(l-75), and/(1-55) given the following data: 

x I 0 0-5 1-0 1-5 2-0 2-5 30 

fix) | 0 0-480 4 0-865 8 1-131 7 1-301 8 1-407 0 1-471 3 
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7 Show that the forward and backward Gauss formulae 

y = To + (“) Ajo + + (“ 3 *) A 3 j_j + . . . 

y - * + (“ 7*) a to + (“) A2 y° + (3) A ^-i + • • • 

are identical if they are truncated by neglecting the fourth and higher 
differences. 

Deduce Bessel’s formula and use it to find cos -1 0-7 to the nearest 
minute given that 

cos 30° = 0-866 0, cos 40° = 0-766 0, 
cos 50° = 0-642 8, cos 60° = 0-500 0. 

8 When x takes the values 12,22, 32, 42, 52, and 62 the corresponding 
values of fix) are 62-146 08, 62-344 11, 62-538 29, 62-728 77, 
62-915 69, and 63-099 18. Evaluate approximately /'(14) and/"(12). 

9 The function fix) has a maximum value for 3 < x < 4. Given the 
following difference table use Bessel’s formula to find the maximum 
value off{x). 

X fix) A A 2 A 3 A 4 

3 1-219 710 -0-137 020 -0-024 872 

-0-014 934 0-062 828 

4 1-204 776 -0-074 192 -0-023 366 

10 The following is a table of fix) = x 2 /(l + * 4 ) from x = 0 to 
x = 1-2 at intervals of 0-1. Find the approximate value of 



using (i) the trapezoidal rule, (ii) Simpson’s rule, and (iii) Weddle’s 
rule. In each case estimate the error involved. 


* 

fix ) 

x 

fix ) 

0 

0 000 000 

0-6 

0-318 697 

01 

0 009 999 

0-7 

0-395 129 

0-2 

0 039 936 

0-8 

0-454030 

0-3 

0-089 277 

0-9 

0-489 101 

0-4 

0-156 006 

1-0 

0-500 000 

0-5 

0-235 294 

1-1 

1-2 

0-491 051 
0-468 506 



15 

Statistics I 


15.0 INTRODUCTION 

The study of statistical methods divides into two parts, descriptive 
statistics which consists in the collection, presentation, and summarizing 
of data and statistical inference which is concerned with drawing con¬ 
clusions about the source of data. 

15.1 STATISTICAL EXPERIMENTS 

In the design of an experiment, two main aims are required. These are 
(i) that it is possible to judge the validity of the conclusions, and (ii) that 
the experiment is efficient in that conclusions may be drawn on as few 
observations as possible. 

A statistical experiment must be capable of being repeated, or be such 
that it can be considered as being repetitive. 

Example (i) An unbiased coin tossed 12 times may produce the result 
HTTTHTHTTHTT 

or 8T, 4 H where T stands for ‘tails’ and H for ‘heads’. The experiment 
of tossing a coin 12 times can be repeated say 6 times to give results as 
follows: 

T I 8 10 11 9 10 9 

H I 4 2 1 3 2 3 

This experiment shows random variation from one experiment to the 
next. 

Example (ii) In an agricultural experiment two types A and B of wheat 
are grown on ten plots with the following yield in kg/m 2 : 

A I 0-29 0-27 0-28 0-27 0-36 0-30 0-28 0-28 0-33 0-29 

B I 0-30 0-31 0-28 0-29 0-27 0-30 0-32 0-30 0-27 0-28 

This experiment is thus repetitive and shows random variation. 

A typical statistical problem occurs in mass production where it is 
necessary to maintain a given standard. For example, a machine is to 
produce ball-bearings of 2 mm diameter, and from a set of fifty observa¬ 
tions it is to be decided whether or not the standard is being maintained. 
It is totally impracticable as well as expensive to measure the whole 
output of the machine. The experiment is repetitive in that many sets 
of fifty observations may be made. Assuming that the machine is set 
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correctly, the measurements from each set of fifty observations will 
show random variation. 

15.1.1 Definitions 

A population is any set of objects about which numerical information 
is required. 

An individual is any member of the population. 

A sample is any set of individuals. 

An attribute is any property which any individual may or may not 
possess and about which information is required. 

Example (i) All the results which would be obtained in an infinite 
number of tosses of a coin is a population. Twelve such tosses constitute 
a sample from this population and each result, H or T, is an individual 
from the population. The attribute considered could be that a head is 
tossed, and the number of times this happened would be counted. It 
would be expected that in the long run, there would be 50 per cent heads 
and 50 per cent tails. 

Example (ii) The heights of all men in England is a population of 
heights. The attribute under consideration might be the average height, 
or the maximum or minimum height. A sample from this population 
might be the heights of 100 men chosen at random, and the statistical 
problem to infer properties of the population from the sample. 

15.2 PROBABILITY 

The whole of statistical methods depend upon probability theory. 

15.2.1 Definitions 

An experiment or trial is the establishing of certain conditions which 
may produce one of several results or cases. 

Example (i) toss a coin, (ii) toss a coin twice, (iii) toss two coins, 
(iv) measure the diameter of a ball-bearing. 

An event is a possible outcome of an experiment. 

A random variably is a variable which specifies the result of an experi¬ 
ment. For example in the coin-tossing experiment, the random variable 
x may be defined as x = 0 if a head is thrown, x = 1 if a tail is thrown. 

Mutually exclusive events: Two events E x and E 2 are said to be mutually 
exclusive if E 2 precludes E x . More generally, E x , E 2 , E 3 , ...,£„ are 
mutually exclusive if all pairs (E t , E t ), i ^ j are mutually exclusive. 




396 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 
For example in the experiment of tossing two coins, 

(i) let Ex be the event H, T, and let E 2 be the event H, H, then £) and 
E 2 are mutually exclusive. 

(ii) let Ex be the event H and T, i.e. HT and TH. Let E 2 be the event H, 
then Ex and E 2 are not mutually exclusive. 

The event either Ex or E 2 is the event that either Ex or E 2 or both, 
happens. 

The event both Ex and E 2 is the event that both Ex and E 2 happen. 
For example, if the experiment is to throw a six-sided die, and Ex 
the event that the score is odd, E 2 the event that the score is a multiple 
of 3. Then the event (either Ex or E 2 ) is the event that the score is 
1, 3, 5 or 6. The event (both Ex and E 2 ) is the event that the score is 3. 

Mutually exhaustive events: The events E x and E 2 are said to be mutually 
exhaustive if either Ex or E 2 must happen. For example, in the experi¬ 
ment of rolling a single die, Ex, the event that the score is even, and E 2 
the event that the score is odd, are mutually exhaustive. More generally, 

Ex, E 2 , E 3 . E n is a mutually exhaustive set if one of the events 

Ex, E 2 , E 3 , ... or E n must happen. 

An elementary event is an event which cannot be decomposed into 
simpler events. For example the event E that the score is 5 on rolling 
a die is an elementary event. The event E that the score is a multiple of 
3 can be decomposed into the elementary events E lt that the score is 3 
and E 2 , that the score is 6. 

15.2.2 Empirical definition of probability 

It is not usually possible to say that the outcome of an experiment will 
be E, but it may be possible to assign a probability to E. Suppose the 
event E occurs m times in n repetitions of an experiment. Then if as 
« —► oo the ratio mjn -> a limit p, this limit is called the probability of 
the event E. This is the empirical or statistical definition of probability. 

Example (i) In the experiment of tossing a single coin, let r be the 
number of heads thrown, and n be the total number of throws. Then 
the probability of tossing a head with this coin is 

Lim [(number of heads)/(total number of throws)] 
which may be written in symbols as 

P{H} = Lim (r/n). 

Example (ii) In the experiment on wheat yields of section 15.1, 
F{wheat yield is between 0-28 and 0-31 kg/m 2 
= Lim [(number of plots with this yield)/(total number of plots)]. 
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Properties 

If E is certain, m = n and P{E} = 1 (1) 

If E is impossible, m — 0 and P{E} = 0 (2) 

Since 0 < m < n, 0 < 1. (3) 

15.2.3 Addition theorem 

If Ei and E 2 are mutually exclusive events 

P {either E x or E 2 ) = P{E X ) + P{E 2 ). (4) 


Let an experiment be performed n times, and let E x be obtained n x 
times and E 2 be obtained n 2 times. Then (either E x or E 2 ) has been 
obtained ( n x + n 2 ) times. Hence 

^{either E x or E a } = Lim [(/ij + n 2 )/n] 

= Lim [njn + n 2 ln\ 

= Lim (njn) + Lim (njn) 

= P{E X ) + P{E 2 ). 

This result is extended as follows. If E x , E 2 , E 3 ,. . ., E„ are n mutually 
exclusive events, 

P{E X or E 2 or E 3 or . . . or E n } 

= P{E X ) + P{E 2 ) + P{E 3 ) + . . . + P{E n ) (5) 

Corollary 1 If E x , E 2 , E 3 . E n are also mutually exhaustive then 

P{E X ) + P{E 2 ) + P{E 3 } + . . . + P{E n ) = 1 (6) 

Proof The event (E x or E 2 or E 3 or . . . or E n ) must happen so that 
P{E X or E 2 or E 3 or . . . or E n } = 1 
and combining this with equation (5) gives equation (6). 

Corollary 2 If an experiment must result in one of n mutually exclusive 
events, i.e. they are mutually exclusive and exhaustive, which are all 
equally likely, then 

P{E X } = P{E 2 } = P{E 3 } = . . . = P{E n } = l/(total number of events) 
= 1 In. (7) 

Proof Since the n events are exhaustive, from equation (6) 

P{E X } + P{E 2 ) + P{E 3 ) + . . . + P{E n ) = 1, 
and since they are all equally likely, 

P{E X ) = P{E 2 ) = P{E 3 } = . . . = P{E n }. 

Hence the result (7). 

Example If x is the score on rolling a die, then 

P{x = 1} = P{x = 2} = P{x = 3} = . . . = P{x = 6} = i 
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Corollary 3 Let the result of an experiment be one of the finite number 
of mutually exhaustive and elementary events E u E 2 , E 3 , . . ., E n , and 
let all these events be equally likely. Let E be the event defined by the 
occurrence of m of these elementary events. Then 

P{E} = min , (8) 

since P{E} = P {any m of the events E u . . ., E n } 

= sum of probabilities of these m events 
= m/n. 

_ number of outcomes favourable to E 

Thus P{E} = -— — ; - r - p — t - (9) 

total number of outcomes 

and this is the result used in calculating probabilities. 

Example (i) If a single coin is tossed, the probability of getting a head, 
i.e. P{H) = i. 

Example (ii) In one throw of a die, 

P {score is 5} = |; 

P {score is even} = 3/6 = 

Example (iii) Find the probability that the score is 8 if two like dice 
are thrown. 

Solution A total score of 8 may be obtained from the pairs (2, 6), (3, 5), 
(4, 4), (5, 3), (6, 2). The total number of possible events is 36 and there 
are 5 outcomes favourable to the event that the score is 8. 
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Table 15.1 shows the representation of each outcome by the pair 
(r, 5), r being the score on the first die, s the score on the second. 

Hence, P {total score is 8} = 5/36. 

Example (iv) What is the probability of selecting a red ball in one 
draw from a box containing 5 red and 3 black balls? 

Solution The total possible number of outcomes is 8, and of these the 
number favourable to the event, drawing a red ball, is 5. Hence, 

P {ball selected at random is red} = f. 


15.2.4 Use of permutations and combinations 

The number of permutations of n unlike objects taken r at a time is the 
number of ordered arrangements of n objects taken r at a time. This is 
n{n — 1)(« — 2). . . (n — r + 1), which is written 
n P T = n\/(n - r)!, 

where by definition, n\ = n(rt — 1 )(n — 2). . .3.2.1 and 0! = 1. 

Example (i) The number of different arrangements of the letters A, B, 
C, D is 4! = 24. 


Example (ii) The number of permutations of 2 letters from the 4 letters 
A, B, C, D is 

*P 2 = 4!/2! = 12. 


The number of combinations of n different objects taken r at a time 
is the number of ways of selecting r objects from the n different objects, 
without regard to order. The number of arrangements is n P T and each 
arrangement can be permuted in r! ways. Hence the number of selec¬ 
tions is 

"i> r /(r!) = n!/(/■!)(« - r)\ 
which is written n C r or • 

The number of ways of dividing n different objects into three groups, 
the first containing n x , the second n 2 , and the third n 3 objects, where 
”1 + n 2 + n 3 = n, is «!/(«i! n 2 \ « 3 !). This is proved as follows. The 

first group is selected in j ways, and once this is fixed there are 
(n — «i) objects from which to select n 2 , and this can be done in 
(” n Hl J ways. The third group is 
of ways of dividing the n objects is 

'-”‘'1 x .« ■ 


v fixed. Hence the total number 


(« ~ "1)! 


w 


n^.{n — /ii)! n 2 \(n — n x — n 2 )\ 
n\ 

—j—:—: since n — n x — n 2 = 
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Extending this argument, the number of ways of dividing n objects into 
k groups, the first containing n u the second n 2 and so on, objects, so 
that «! + n 2 + « 3 + . . . + n k = n is 

nl 

n x \ n 2 \ n 3 !. . . n k \ 

Example (i) A box contains 5 red and 3 blue balls. Find the probability 
of the event E that two balls selected at random are both red. 

Solution The total number of possible outcomes is the number of ways 
of selecting 2 from 8, which is Q j. The number of outcomes favourable 
to E is the number of ways of selecting 2 red from 5 red balls which is 

(2)' Thus, 

r{£} — (2) / (2) =5/14. 


Example (ii) A box contains 2 red, 4 blue, and 5 white balls. Find the 
probability of the event E, that 4 balls selected at random contain 
exactly 2 white balls. 


Solution The total number of possible outcomes is the number of ways 
of selecting 4 from 11, which is ^ j. The number of outcomes favour¬ 
able to the event E is the number of ways of selecting exactly 2 white 
from 5 white and 2 others from the remaining 6 others. This is ^ ' 

Hence 

W0, 




15.2.5 Use of the complementary event 

The events E and E are complementary events if E occurs whenever E 
does not. Thus E and E are both mutually exclusive and exhaustive. 
Hence 

P{E) = 1 - P{E}. (10) 

If P{Ej = p and P{E) = q, then 

P + q= 1 or q = l — p. (11) 

Example (i) A box contains 2 red, 4 blue, and 5 white balls. Find the 
probability that 4 balls selected at random contain at least one white 
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Solution Let E be the event that at least one of the four selected is 
white, then E is the event that none of the four is white. Now 



and hence P{E} = 21/22. 

Example (ii) In an experiment in which a coin is spun 12 times, find 
the probability that at least 2 heads will be thrown. 

Solution Let E be the event of obtaining at least 2 heads. Then the 
probability of a head (H) in one throw is The number of ways of 

obtaining exactly 2 heads and 10 tails is / . J, and the probability of 

***(?)(»•(!)' 

There are two methods of obtaining P{E}. 

Method ( a ) The probability of the event E is the probability of obtain¬ 
ing 2, 3, 4, . . ., 12 heads. Hence 
P{E } = P{2H or 3H or . . . or 12 H} 


= P{2H} + P{3H} + . . . + P{12H} 



Method ( b ) E is the event of obtaining at least 2 heads, so that the 
complementary event E is the event of obtaining no head or one head. 



and P{E } which is 1 - P{£} is given by 

P{E}= 1-13(^) 12 - 
15.2.6 Conditional probability 

The probability that the event E 2 will occur knowing that the event E x 
has already occurred, or is certain to occur, is called the conditional 
probability of E 2 , given E x . This is written P{E 2 \E X ). 

Suppose E x and E 2 are outcomes of an experiment which is repeated 
n times. Let E x occur n x times, and let the number of times both E x 
and E 2 occur be n 2 . Then the probability of E 2 given that E x is certain 
= Lim (n 2 /n x ) 

-H&) 

— Lim (« 2 /«)/Lim (njri) 

= P{both E x and E 2 }/P{E X }. 
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Hence the Multiplication theorem follows in the form 

P{both E x and E 2 } = P{E X }P{E 2 \E X }. (12) 

Two events E x and E 2 are said to be independent if 

P{both E x and E 2 } = P{E X }P{E 2 } (13) 

i.e. if P{E 2 \E X } = P{E 2 }. 

The definition of independent events is extended to the k events 
E lt E 2 , E 3 , . . ., E k , which are independent if 

P{E X and E 2 and . . . and E k } = P{E X }P{E 2 } . . . P{E k } (14) 

Example (i) Two coins are tossed. What is the probability that the first 
coin shows a tail and the second a head? 

Solution Let E x be the event tail on the first coin and E 2 be the event 
head on the second coin. Then since all possible outcomes may be 
written 

HH, HT, TH, TT, 

the required probability 

•P{both E x and E 2 ) = £ = \ x \ = P{E X }P{E 2 ). 

Hence E x and E 2 are independent events. 

Example (ii) Two dice are rolled. What is the probability that the total 
score is eight if it is known that the dice show at least three points each ? 

Solution Let E x be the event that the dice show at least 3 points each, 
E 2 the event that the total score is 8, then, using Table 15.1, 

P{E 2 \E x } = P{both E x and E 2 }!P{E X } 

3/36 3 

“ 16/36 " 16 

Example (iii) Three dice are rolled. If no two dice show the same face, 
what is the probability that at least one face shows a six? 

Solution Let E x be the event that no two faces show the same score, 
E 2 the event that at least one face shows a six, then E 2 is the event that 
no face shows a six. Then 

P{E 2 \E x } = 1 - P{E 2 \E x }. 

Since there are 3! ways of arranging sets of three, 

P{E X }= (3) (4) 3 3! = (6.5.4)/6 3 ; 

or alternatively, P{E X } = P{lst is any number and 

2nd is a different number and 
3rd is different from 1st two} 

_6 5 4 
~ 6 6 6 ' 
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_ 543 5.4.3 

Also, P{both E 2 and Ej} = g. ^ ^ == —^ 5 — 

Hence, P{£ 2 {EJ = P{both £ 2 and EJ/PIEJ 

= (5.4.3)/(6.5.4) = i 
Hence P{E 2 \E 1 } = 


15.3 RANDOM VARIABLES 

Consider an experiment A which may have several outcomes. It is 
convenient to introduce a variable x to correspond with each event of 
A. For example if a coin is spun the random variable may be defined by 
x = 0 if the event is heads, x = 1 if the event is tails. If two coins are 
spun and the number of heads (H) counted, the random variable may 
be defined as x = 0 if no H, x = I if 1 H (HT or TH), and x = 2 if 
2 H. In the experiment of wheat yields of section 15.1, x may be defined 
as the yield in kg/m 2 , in the experiment of measuring the diameter 
of ball-bearings x may be defined as the diameter in mm. The random 
variable x is defined as a number associated with each event of an 
experiment, random since its value depends on the outcome of an 
experiment and hence on chance. 


15.3.1 Discrete random variables 


A discrete random variable is a random variable that can take only a 
fini te number of distinct values which can be arranged in a definite 
order. 

Let the random variable x take values x u x 2 , x 3 , . . ., x n corre¬ 
sponding to all the n events of an experiment. 


Let Pi = P{x = xj] 

p 2 = P{x = x 2 } 
p 3 = P{x = x 3 } 


(15) 


p n = P{x = x n } J 

Then the set of numbers p t , i =*- 1, 2, 3,. . ., n, defines a frequency 
function f(x) which gives the probability that the random variable x will 
assume any value in its range. A frequency function is often given as a 
table of values. Thus, if a single die is rolled and x is defined as the 
score, the table of values is given by Table 15.2 

x I 1 2 3 4 5 6 

f ( x ) I 1/6 1/6 1/6 1/6 1/6 1/6 


Table 15.2 
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It is sometimes convenient to graph the frequency function f(x ) by a 
line graph, as in Fig. 15.1. 



If two dice are rolled, let x be the total score, then Table 15.3 is obtained 
from Table 15.1. 

x I 2 3 4 5 6 7 8 9 10 11 12 

/(*) | 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36 

Table 15.3 

and the line graph is shown in Fig. 15.2. 



Since /(x,) = P{x = x,} = p„ i = 1, 2, 3,. . ., n, 
and there are exactly n events associated with the experiment, 
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The frequency function determines the distribution of any event 
associated with the experiment. For example 

P{x r < x < x s } = P{x = x r } + P{x = x r+1 } + . . . + P{x = x s } 

= /(*r) +M + 1 ) + • - - +/(*,) (17) 

An important function associated with the frequency function is the 
distribution function F{x), sometimes called the cumulative distribution 
function, which is defined by 

F(x)= 2/(0, (18) 

i<x 

where the summation is taken over all those values of x less than or 
equal to a given value. Hence 

p(x k ) = 2 m 

=/(*i) +/(X 2 ) +/(x 3 ) + . . . +f(x k ), 
or F(x k ) = pi + p 2 + p 3 + ■ ■ ■ +Pk> (19) 

i.e. F(x k ) gives the probability that the random variable x is less than 
or equal to the given value x k , whereas f(pc k ) is the probability that the 
random variable x is equal to x k . Hence 

P{a<x<b} = F(b) - F(a). 

The graph of F(x) for the frequency function of Table 15.3 is shown in 
Fig. 15.3. 



Fig. 15.3 


15.3.2 Continuous random variables 

If a random variable is capable of taking any value in a given interval 
or intervals it is called a continuous random variable. Thus the random 
variable x defined as the height of a man from the population of the 
heights of all men in a given place is a continuous random variable, 
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since it can take any value in an interval say from 1-5 metre to 1-8 
metre. Variables which involve measurement such as height, length, 
temperature, and weight are continuous variables. 

As an example consider the set of numbers x representing the 
diameter in mm of 200 ball-bearings, the diameters being measured to 
three places of decimals. Values of x are thus regarded as values of a 
continuous variable whose values have been rounded off to three 
decimal places, so that a measurement of 0-155 mm means that the 
actual measurement is between 0-1545 mm and 0-1555 mm. The fre¬ 
quency or number of times the value x t occurs is denoted by/ ( , so that 
2/i = 200 and the resulting frequency distribution is shown in table 
form in Table 15.4. 

x I 0-115 0-125 0-135 0-145 0-155 0-165 0-175 0-185 0 195 0-205 0-215 

/ I 6 9 11 21 31 39 32 25 12 10 4 

Table 15.4 

The frequency here denotes the absolute frequency with which x occurs, 
and this table is the type of result usually obtained in an observed 
frequency distribution. The word frequency more often denotes the 
relative frequency, which is the ratio of absolute frequency to the total 
number of observations, since relative frequency approaches probability 
as the number of observations increases indefinitely. 



Fig. 15.4 

The results of Table 15.4 are represented graphically by the histogram 
of Fig. 15.4 in which areas represent absolute frequencies: for example, 
the area of the rectangle the centre of whose base is 0-115 represents 
frequency 6. The total area of the histogram is thus 200. The length of 
the interval on the x-axis, here given by 0-01, is usually denoted by h 
and called the class-interval. 
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If the measurement of 200 ball-bearings is considered as 200 runs of 
an experiment, a further 200 runs would result in a histogram twice the 
area of that in Fig. 15.4. In order to compare histograms based on 
different numbers of experiments, it is more convenient to choose the 
scale on the ordinate-axis to represent relative frequency fjn so that 
the total area is 1. With this choice of unit the histogram approaches a 
fixed histogram as n oo. Also if the measurements x can be made 
as accurately ‘as we please’, the class interval h can be chosen as ‘small 
as we please’ so that the histogram approaches a smooth curve as 
A —► 0 and n -*■ co. If this curve can be represented by f(x), this is the 
frequency function of the continuous random variable x. For a con¬ 
tinuous random variable the frequency function is sometimes called the 
probability density function. 

The frequency function for the continuous random variable x is 
defined as the function f(x) which has the following properties: 

(0 m> 0; (20) 

(compare with p { > 0 all i). 

(ii) j" > J(x)dx = U (21) 

(compare with 2/h = 1). 

(iii) JV(*) dx = P{a < x < b} (22) 

where a, b are any two values of x; (compare with P{x r < x < x,} 

= imy 

The distribution function F(x) is defined by F(x) = 2 /(/)> the 
summation now being the limit of a sum and hence an integral, or 



F(x) = J J{t)dt 

(23) 

and 

fix) = F'(x). 

(24) 


The value of statistical methods is in finding mathematical models to 
fit as closely as possible the observed or empirical data. There are 
standard frequency functions which are used most often as models, and 
some of these are demonstrated in the following examples. 

Example (i) Consider the possibility of using 


/(*) = 


\0 


x >0 


x < 0 


as a frequency function for x, where A is a positive constant. 
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Solution Checking the properties (20) and (21) gives 

(a) 2c ~ Al > 0 all x; 

(b) j f(x ) dx =* J 2e _ Ax dx 



= 1 for all 2. 


The distribution function F{x) is given by 


F(x)= [ J(f)dt 



Fig- 15.5 Fig. 15.6 

The graphs of fix) and F(x) are shown in Figs. 15.5 and 15.6, and 
P{a<x<b) = JV) dx 

= F{b) - Fid) 

= 1 — e~ Xb — 1 + e ~ Ka 
= e~ Xa — e _ A6 . 

Although the choice of fix) as a negative exponential appears to be 
arbitrary, it is found that this function can be used as a good approxi¬ 
mation to many observed frequency distributions. 

Example (ii) A uniform, or rectangular distribution, is of use when the 
continuous random variable x is equally likely to take all values between 
two given values a and b. Consider the possibility of using fix) = c, 
where c is a constant, as the frequency function for x. 




STATISTICS 


409 


Solution Since, from equation (21), J fix) dx = 1, then c = 1 /(b — a). 
Hence the frequency function becomes 

/W -(th tora<x<t, 

1.0 for all other values of x. 

The distribution function F(x) is given by 

F(x) = J At) dt = (x — a)j{b - a), 
and the functions f(x), F(x) are sketched in Figs. 15.7 and 15.8. 



O 1 a b x O 1 a 6 

Fig. 15.7 Fig. 15.8 


Example (iii) The life x, in hours, of a certain type of electronic valve 
has the frequency function 

fix) = 100/x 2 , x > 100; 
fix) = 0, x < 100. 

(a) Find the probability that the life of a valve is less than 150 hours. 

(b) If the valve is still working after 150 hours, find the probability that 
it will have to be replaced before 250 hours life. 

Solution 

(a) P{x < 150} = fV) dx 

« f 1S0 . 

= 0 + L^ dx 

=i 

(b) Let E x be the event x > 150 and let E 2 be the event x < 250. Then 

PiEzlEy) = P{E 2 and E 1 }/P{E 1 } 

= P{E 2 and EJ /[1 - P{EJ]. 

Now, from (a), P{E{) — i- 

Also P{E 2 and E x } = J ^ dx = 

Hence P{E a |£ x } = f, and this is the probability that a valve, if it lasts 
more than 150 hours, will cease to function before 250 hours of its life. 
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15.4 REDUCTION OF DATA FOR AN OBSERVED 
FREQUENCY DISTRIBUTION 

A mass of raw data needs to be ordered if it is to be easily read and 
understood. Measures of location and dispersion are used to describe 
the distribution in few and simple parameters. 

15.4.1 Measures of location 

(a) The mean or arithmetic average is the most important measure of 
location. For a discrete random variable, suppose there are f t values 
of x — x t , i = 1, 2, 3,. . ., k, then the mean m or x is defined by 

(2/t *<) j «. where2/ t = n. 

Example Find the mean of the distribution given by the following 
table: 

x 

/ 

Solution 
x or m 
= (0 X 14 + 

= 2-43. 

For a continuous random variable, as in Table 15.4 of section 15.3.2 
it is necessary to assume that approximately all the observations in a 
given class interval have the value at the mid-point of that interval. If 
x t is the mid-point of the ith interval, and /, is the corresponding class 
frequency, then as before: 

x or m = (2/, x,)/n. (25) 

The mean of the distribution of Table 15.4 is 0 165 3. 

(b) Suppose there are n observations which can be arranged in order, 
then the median is the value of the middle observation. If n = 2k + 1 
(odd) the median is the value of the (k + l)th observation. If n = 2k 
(even) the median is the value of the average of the ki\t and (k + l)th 
observations. The median is of use for example, in describing the wage 
level in a certain industry where a few very high wages affect the mean 
but not the median. The median wage is such that half the employees 
receive at least this much. 

(c) The mode is the observation which has maximum frequency. For 
example for the set of measurements 4, 4, 5, 5, 5, 6, 7, 7, 8 the mode 
is 5. A single mode gives a rough and quick measure of location but 
is not very useful, particularly if the histogram has more than one peak. 


10 1 2 3 4 5 

I 14 15 23 22 14 12. 

2/, = 100 = n, 

Ixl5 + 2x 23 + 3x 22 + 4xl4 + 5x 12)/100 
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15.4.2 Measures of dispersion 

These are measures of the spread of the observations. 

(a) The range is the difference between the largest and smallest measure¬ 
ments in a set of measurements. It is used in quality control. 

(b) The mean deviation is the average of the deviations of each observa¬ 
tion from the mean, all deviations being taken as positive, i.e. 

mean deviation = “ 2 l*i — m \ 

or 

(c) The variance and standard deviation, which is the square root of the 
variance, are the most important measures of dispersion. Variance, 
denoted by s 2 , is defined by 

■ y2 = ^ 2 ( x * “ m ) 2 fi (26) 

and the standard deviation s is the square root of this. 

(d) The interquartile range is defined as Q 3 — Q u where Q x is that 
value such that £ of the observations are less than Q u Q 2 is the median, 
Q 3 is such that £ of the observations are less than Q 3 . 

15.5 MEASURES OF LOCATION AND DISPERSION 
FOR A PROBABILITY DISTRIBUTION 

15.5.1 The mean 

(a) First, consider the mean of a probability distribution for a discrete 
random variable. Let an experiment be repeated n times, and let there 
be /, observations x h i = 1, 2, 3,. . ., k. Then the mean of the n 
observations is m, and using the definition of section 15.4.1, 



Now as n -*■ co, f/n -*■ p K where p t = P{x = x f }, and hence m -» £ p t x ( . 
Hence the mean of a probability distribution for a discrete random 
variable is defined as 

fi = 2p t x,. (27) 

(b) Second, consider the mean of a probability distribution for a con¬ 
tinuous random variable. Let an experiment be repeated n times, and 
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let the observations be grouped into classes of width h — Ax. Let /, 
be the number of observations in the ith interval and x, be the mid¬ 
point of that interval. Then m, the mean of the grouped distribution is 
given by 



Now, as n ~> co, the value of/,/n approaches the probability that x lies 
in the ith interval, i.e. /,/n ->/(x,) Ax, where fix) is the frequency 
function. Hence, as n -*■ oo, m -*■ ^{/(x,) Ax}x,. If also Ax ->■ 0, then 
m -> jx/(x) dx. Hence the mean for a continuous random variable is 
defined by 

H =Jx/(x) Jx, (28) 

the integration being over the range of values of x for which /(x) is 
defined. 

The mean for any distribution is thus defined by the first moment of 
the distribution. Compare with formula (12) of section 7.2.3. 

15.5.2 Dispersion, variance 

(a) First consider the variance of a probability distribution for a 
discrete random variable. For n observations x,, where x, occurs /, 
times, i = 1,2, 3,. . ., k, 

s * = hiffc * - m ) 2 
= ? (■») <*' - 

As n ->■ t», the value of (/,)/« -+p { = P{x = x,}. Hence the variance 
of the probability distribution is defined by 

(ifi. (29) 

(b) Second, consider the variance of a probability distribution for a 
continuous random variable. For n observations the variance of the 
grouped distribution is 
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As n -»• oo, the value (fd/n -* f(x t ) Ax, and if at the same time Ax -* 0, 
the variance of a continuous probability distribution is defined by 

° 2 = /(* ~ PWW dx ' (3°) 

The integration is taken over the range of values of x for which/(x) 
is defined. 

The variance of a distribution is defined by the second moment of the 
distribution. Compare with formula (20) of section 7.2.7. 

Example (i) Let * be the score recorded when a die is rolled. Find the 
mean and variance for the probability distribution. 

Solution The variable x is discrete and the following table gives the 
(rectangular) probability distribution. 

* I 1 2 3 4 5 6 

p I t * \ i i t 

The mean p and variance a 2 are given by 

p = T,p t xt = 1(1 + 2 + 3 + . . . + 6) = 3-5, 
o 2 = 2 Pl (x, - pf 

= K1 - 3-5) 2 + 4(2 - 3-5) 2 + . . . + |(6 - 3-5) 2 
= 35/12. 

Example (ii) Find the mean and variance for the rectangular distribu¬ 
tion given by 

«*)-(" 5 * -*<*<*> 

(=0, when x is not in the interval (—a, a). 

Solution Using formulae (28) and (30), the mean p and variance a 2 are 
given by 

= J_ o xf(x) dx = j l(x/a) dx = 0, 
o 2 = j\x-0) 2 f(x)dx^i a2 . 

Example (iii) Find the mean, variance, and median for the negative 
exponential frequency function f(x), defined by 

(Xe~ Xz , x > 0 
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Solution The mean p, is given by 

H = J x/(x) dx = |* xle~ Xx dx = 1/A, 

on integrating by parts. 

The variance is or 2 , where 

a 2 = JT°(x- 1 IXYte-^dx 

= J x 2 Ae“ Xx dx — j J Axe _iU £/x + | J e~ Xx dx 

2_2 11 
“ A 2 A • A + A 2 

_ 2 

“ A 2 ' 

The median x m is given by 

J Ae - Xx dx = i, 

since half the population lies each side of the median. Hence 



Solving, x m = (log 2)1 k. 

15.6 PRACTICAL COMPUTATION OF MEAN AND 
VARIANCE 

It is possible to simplify the arithmetic involved in computing the mean 
and variance by coding the data. There are two processes of coding 
used, (a) changing the origin, or using a ‘working zero’ and (b) changing 
the scale, or unit. Changing the origin involves writing y = x — x 0 
and changing the scale involves writing u == y/c, where the constant x 0 
is the new origin, or working zero, and c is any convenient constant. 

15.6.1 Computation of the mean 

Writing y t = x t — x 0 , u t = yjc, means that x t = o/, + x 0 for each /, 
and hence, 

x = cu + x„. (31) 

Example Find the mean of the three numbers 
x = 0 169, 0 162, 0170. 
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Solution Write y = x — 0 160 = 0 009, 0-002, 0-010, and u = 1 OOOy 
= 9, 2, 10. In working units the mean is $(9 + 2 + 10) or 7. Partly 
decoded it is 0-007, and fully decoded the mean is 0-167. Alternatively, 
u = 7, c — 1/1 000, x 0 = 0-160, so that 

x = 0 007 + 0-160 = 0-167. 

15.6.2 Computation of variance 

(a) Changing the origin to x 0 by writing y = x — x 0 gives the result 

si = si (32) 

i.e. coding by changing the origin has no effect on the variance. This 
may be proved as follows: 

= \ 2/i(*i -x 0 -x + x 0 ) a 
=;; 2/<(*< - *) a 

(b) Changing the scale so that u = y/c gives the result 

si = sl/cl (33) 

or, variance of u = (variance of y)/c a . 

This is proved as follows: 

4 = ~ «) a 

A 9 

i _ 

~ c 2 Sy ‘ 

Hence s% = sj = c 2 j*. (34) 

From the example of section 15.6.1, 

u = 1 000y = 9, 2, 10, 
u = 7, 

si = M2 2 + 5 2 + 3 2 ) = 38/3. 
s% = 38/(3 x 10 8 ). 


Hence 
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15.6.3 Modified form of formula for variance 

Because the mean x is not usually a simple number, and even the coded 
mean u may not be simple, the calculation of variance is often simplified 
by using the identity 


E/^m, - uf = S/ <M f nu\ 

This identity is not difficult to prove, for if 2/, = n, 

E fi(u t — ti) 2 = E f(u 2 — 2ufi + u 2 ) 

= S f t u 2 t - 2 u S f Ui + u 2 2/,. 

Now (E fiU t )jn = u, and hence 

E ffyt, - uf = S/iU 2 - 2 u(nu) + u 2 n 
= E/(«? - nu 2 . 

Thus the modified form of the variance s 2 is given by 

^ = \ i flu, - uf = (J | /,«?) - « 2 . (35) 

Alternatively, equation (35) may be written 

(36) 

Example Find the mean and standard deviation for the following 
distribution. 

x I 9-95 9-96 9-97 9 98 9-99 1000 1001 1002 10 03 10 04 10 05 
/ I 2 4 5 8 10 16 15 17 11 7 5 

Solution The work is set out in table form as follows, with 

= 100(at ( - 10 00). 


/_“_ u f u 2 f — {uf)u 


9-95 2 

9-96 4 

9-97 5 

9-98 8 

9-99 10 

10 00 16 

1001 15 

10-02 17 

10 03 11 

10-04 7 

10-05 5 


-5 -10 50 

-4 -16 64 

-3 -15 45 

-2 -16 32 

-1 -10 10 

0 0 0 

1 15 15 

2 34 68 

3 33 99 

4 28 112 

5 25 125 


Totals 


100 


68 


620 
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The calculation proceeds as follows: 
it = 68/100 = 0-68, hence the mean x is given by 

x = 0 006 8 + 10 00 = 10 006 8. 

Also si = 620/100 - (0-68) 2 = 6-20 - 0-462 4 — 5-738. 

Hence si = 0-000 573 8, and the standard deviation is given by 

■r* = 0-023 96 0-024. 

15.7 THE BINOMIAL DISTRIBUTION 

There are three standard theoretical frequency distributions to which 
practical, or observed, distributions may approximate closely. The first 
of these is the binomial distribution. 

Suppose the result of a trial is only success or failure, i.e. a particular 
event occurs or does not occur. Let the probability of success be p, then 
q — 1 — p is the probability of failure, since success and failure are 
independent and exhaustive events. Let there be n trials and let r be 
the number of successes recorded; and further, suppose that successive 
trials are independent. The problem is to find the frequency distribution 
of r, which can take only integer values. 

The probability of obtaining r successes and (n - r) failures in a 
specified order, say the first r trials result in success, is 

r ( n-r ) 

P-P-P-. . .p.q.q.q.. . .q 
which equals p T q n ~ r . 

The number of different ways in which r successes can occur in n trials 
is ^ r j- Hence the probability of r successes in any order is p T q n ~ T . 

This expression is the (r + l)th term in the expansion of the binomial 
expression (q + p) n and is called the binomialfrequency function f(r), i.e. 

/O') = (”) p r q n ~ T - (37) 

The symbol r is used to emphasize that r is a discrete variable which 
can take integer values only. Hence 


P{r = 0} = ( ( 

o)/>V = 

q n . 

II 

i 

= npq n ~\ 

P{r = 2} = (! 

jW- s 

= — 1 
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and so on. The terms on the right are recognized as successive terms 
of the binomial expansion of ( q + p) n , since 

0 q + pY = q n + nq n ~ x p + \n(n - 1 )q n ' 2 p 2 + . . . + p n (38) 




pY r 

= 2m 

Also, since q + p = 1, then 

2/(0 = 1. 

15.7.1 Mean and variance of the binomial distribution 

Mean of r = p — 2 r f( r ) 

r=0 

i r.nl 
” £ 1 rl(n-r)\ Pq 


(39) 


A (r - !)!(» “ 0- 


— (r — 1YM ^ 9 


- £ (r - 1)!{(« - 1) - (r - l)}!' 
and writing r — 1 = s, 

(«- 1)1 

mean of r = np 2 sK „_ l _ s)l PY n 

— np{q +/>)“" 1 
= np since q + p — 1. 

Hence, mean of r = np. (40) 

Using similar algebraic manipulations, 

variance of r = a 2 = 2 r V( r ) ~ /“ 2 > (41) 

and writing r 2 = r(r — 1) + r in the first expression on the right, 

1 r 2 f(r ) = I Hr - 1 ) + r} (") P Y~ r 

= 2 r ( r ~ 0 (”) PY~ r + / r pY~ r 

= | , '('' - 1) r ) \ prqn ~ T + nf (fr ° m(40)) 

(« ~ 2)! 




= 2 ( r — 2)!{(» — 2) — (r — 2)}! 


«(«- i)/>y- 2 
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Writing r — 2 = s in the summation, 
n n _ 2)* 

4To r = s ^ j!(« — 2 — s)\P’q n 2 ’ n ^ n ~ ^P 2 n P 

= n(n — 1 )p\q + p) n ~ 2 + np 
= n(n - l)p 2 + np. 

Finally, substituting in equation (41), 

° 2 = «(« - 1 )p 2 + np — n 2 p 2 
= «p(l -p) 


Hence, for the binomial distribution, 

mean fi = np ( 42 ) 

variance a 2 = npq ( 43 ) 

Example (i) A machine makes certain products which are known to be 
on average 1 per cent defective. What is the probability that a box of 
one hundred of these products contains (a) exactly 3 defectives, (b) at 
least 3 defectives? 

Solution Let r be the number of defectives in a box of 100, then r has 
a binomial distribution with n = 100, p = 0-01, q ~ 0 99. 

(a) P{r = 3} =/(3) 

= (0-01) 3 (0*99) 97 

= 0 060 5 

O’) P{r > 3} = P{r = 3 or r = 4 or r = 5 or . . .or r ~ 100} 

= 1 - P{r = 0 or 1 or 2} 

= i -/(o) -m -/(2) 

= 1 “ (T) (0 ' 99)1O ° “ (T) (°- 01 X0-99) 99 
/100\ 

- ( 2 ) (0 01) 2 (0-99) 98 

= 1 - 0-366 44 - 0-370 13 - 0-185 065 
= 0-078 36 
— 0-078 4. 

In the calculation /(0) = (0-99) 100 was calculated using five-figure 
logarithms,/(l) was then calculated as 100/(0)/99 and/(2) as iff 1), i.e. 
each term of the binomial distribution was calculated from the previous 
term. 

The mean number of defectives in a box of 100 is np = 1 and the 
variance is npq — 0-99. 
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Example (ii) If the probability that a target will be hit on a single 
bombing run by an aircraft is 01, find the probability that on 20 such 
runs the target will be hit at least twice. 

Solution In this example, p = 0-1, q = 0-9, and n = 20, so that 
P{r > 2} = 1 - P{r < 2} 

= i -/(o) -m 

= 1 - ( 2 0 °) (0T)°(0-9) 20 - ( 20 ) (0-1)(0-9) 19 
= 1 - 0-121 56 - 0-270 13 
^ 0-608 3. 


15.8 THE POISSON DISTRIBUTION 


Suppose r has a binomial distribution f(r) = p r q n ~ r and that the 

number of trials n is very large, then the computation of f(r) becomes 
very lengthy. There are two standard frequency functions which give a 
good approximation to the binomial distribution when n is large. The 
first is the Poisson distribution in which p is also very small so that np 
remains finite. 

To derive the Poisson frequency function, consider n -*■ oo and 
p -*■ 0 in such a way that np = p is a finite constant. Expand the 

binomial coefficient ^ and multiply numerator and denominator by 
n T to give 


r factors 


fir) 


1 n(M - !)(» - 2) . . . (If - r + 1) 

---- {npU 


Since q = 1 — p, np — p, and q = l — p/n, this expression becomes, 
on substituting, 

It can be proved that 




so that, as n -*■ oo and p -*• 0, the value of f(r) approaches (p r e~“)/r\. 
Hence the Poisson distribution is given by 
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15.8.1 Mean and variance of the Poisson distribution 

Using definition (27), with x = r;p =/(r), then 


nof r = 2r/(r)= f 


r 0 r\ 

Writing r — 1 = j in this summation gives 

mean of r = e~ “ u Y 
s=o 

= e“ "jU e“ 

= ft. 

Variance of r = <r 2 = f r 2 /(r) - /< 2 ( 45 ) 

and 

Set r 2 = r(r - 1) + r, so that 

l o rV(r) = 2 r(r - 1) e- £ + f r e- £ 

Writing r - 2 = j in the first summation and r - 1 = / in the second 
summation, 

J Q r>f(r) = e-> 2 | o g + e -> f £ 

= e'* p 2 e“ + 

= A* 2 + /*• 

Substituting in equation (45) gives 

variance of r = p. 

Hence for the Poisson distribution, 

mean = variance = p. 


( 46 ) 
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15.8.2 The Poisson distribution as an approximation to the binomial 
distribution 

The number of happenings of a rare event in a large number of trials 
approximates to a Poisson distribution. The approximation is fairly 
goodwhenn > 100,/> «£ 0 05. Figures 15.9 and 15.10 show the binomial 
and Poisson distributions when p, = 4, p = 0-4, and p = 0-04 respec¬ 
tively. 



Fig. 15.9 Binomial (-•) and Poisson (.■) distributions for fi = 4, p = 0-4 



Fig. 15.10 Binomial (-) and Poisson (-) distributions for fi = 4,p = 0 04 

Example (i) Suppose that example (i) of section 15.7.1 is evaluated 
using the Poisson distribution instead of the binomial distribution. 
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Solution Here n = 100, p = 0 01, p = 1, and 
f(r) = e-'V/H 
P{r = 3} =/(3) 

= e -1 (l) 3 /3! 

= 0061 3; 

P{r ^ 3} = 1 -/(0) -/(l) —/(2) 

= 1 - e _1 (l + 1 + i) 

= 1 - 0-919 7 
^0-080 3. 

Example (ii) Boxes contain 200 articles known to be on average 1 per 
cent defective. The manufacturer guarantees that in a box of 200 there 
will be fewer than three defectives. What is the probability that the 
guarantee will be violated? 

Solution The number of defectives in a box of 200 has a distribution 
which approximates to a Poisson distribution since n = 200 is large, 
p — 0-01 is small and np = p = 2. 

Let r be the number of defectives, then 
f(r) = e- 2 2 T /r\ 

Hence, approximately 

P{r < 3} = e- 2 (l + 2 + 2 2 /2!) 

= 5e- 2 
= 0-676 7. 

Exactly, using the binomial distribution and a longer calculation, 

P{r < 3} = J (0-01) r (0-99) 2OO-r 
= 0-679 78. 

Hence the probability that the number of defectives in a box is three 
or more is 0-323 3, and the probability the guarantee is violated is 
0-323 approximately, or 32-3 per cent. 

15.8.3 The Poisson distribution as an independent distribution 

If events occur at random at a constant rate in time or space, it can be 
shown that the number of events in a given time interval or in a given 
region has a Poisson distribution. The assumption is made that the 
number of events occurring in say, a given time interval is independent 
of the number that occurred in earlier intervals. Consider an example 


Then (a) 

and (b) 
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of an observed distribution that can be thought of as having a Poisson 
distribution. Table 15.5 gives the observed distribution of a-particles 
emitted in 6 seconds. The variable x is the number of a-particles 
emitted per 6-second interval and x was observed in 2 500 such 
intervals. The mean x for the observed distribution is 3-76. If this is 
taken as an estimate of the mean of a Poisson distribution, (j, = 3-76, 
then the corresponding frequencies would be expected to be 

2 500 e- 3 76 (3-76 ) r /r\, r = 0, 1, 2,. . .. 10. 

The results of computing these values are given in Table 15.5. 

a 0123456789 >10 

observed 

frequency 71 210 384 515 501 383 231 121 46 23 15 

expected 

frequency 58-2 218-8 411-4 515-7 484-7 364-5 228-4 122-7 56-7 24-1 13-8 

Table 15.5 

There is good agreement here. Had it not been so, the Poisson 
distribution as the mathematical model for this observed distribution 
would not have been acceptable. Other examples where the Poisson 
distribution has been found to be a good mathematical model are (i) the 
number of calls arriving at a telephone exchange in a given interval of 
time, (ii) the number of cars passing a given point of a busy road in a 
given interval of time, (iii) the number of blood cells in a given area say 
1 cm 2 of a haemocytometer, (iv) bacteria counts, (v) stars in space, and 
in many more similar cases. 

15.9 THE NORMAL OR GAUSSIAN DISTRIBUTION 

The binomial and Poisson distributions are both examples of theoretical 
distributions of a discrete random variable. The normal distribution is 
a continuous distribution whose characteristics are often found in 
practice and hence it serves as a mathematical model for these distribu¬ 
tions. It has the frequency function defined by 

(47) 

where fi and cr 2 are constants. 

The characteristics of the graphs of y — f(x) defined by equation (47) 
are that all are bell-shaped and symmetrical and die out quickly at the 
tails. A typical normal distribution is shown in Fig. 15.11. 

It can be shown that 

jj(x)dx=l for all values of fi and a 


(48) 
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15.9.1 Mean and variance of the normal distribution 

It can be shown that 

mean = J xf(x) dx = fi, (49) 

and variance = |* (x — /x) 2 f(x) dx = a 2 . (50) 

The integrations (48), (49), and (50) are beyond the scope of this 
book. 

15.9.2 The standardized variate 

Usually in problems the values of n and a are known or can be found. 
Probabilities, corresponding to the areas under the curve y = f(x) 
between given limits are required, and these are tabulated or, in some 
cases calculated, for ju = 0, a 2 = 1. If the random variable X has a 
normal distribution with fi = 0 and a 2 = 1, then X is said to have a 
standardized normal distribution. If x has a normal distribution with 
mean fx and variance a 2 , this will be written x — N(/u, a 2 ). Changing 
the origin and scale so that the new mean is zero and the new variance 1, 
i.e. writing y = x — /x gives 

mean of y = mean of x — fx = /x — /x = 0, 
and writing X = y/a = (x — fi)/a gives 

var ( X ) = var ( y)/o z = var (x)/o 2 = a 2 1 a 2 — 1, 
i-e. X=(x- ft) I a = N(0, 1). (51) 

The graph of y = f(X) = [exp (- |X 2 )]/V(2 tt) is sketched in Fig. 
15.12. It is an even function since f(—X)—f(X) and hence is sym¬ 
metrical about the j-axis. 


I 
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The distribution function F(X) is defined by 

F(X) = dt = J ^ exp (-^ 2 ) dt (52) 

(see equation (23) of section 15.3.2). This gives the area under the curve 
y =f(X) from -oo to I and represents the probability that X is less 
than some fixed value. 

Also, from equation (22) of section 15.3.2, 

j*Vw dX = P{a<X< b}. 



It is possible to evaluate the integral on the left between finite limits, 
using the numerical methods of Chapter 14. In particular, using 
Simpson’s rule of section 14.8.3 with h = 1, 

-Jrr T exp (-£X 2 ) dX = 0-497 5 ^ 0-5. (53) 

V( 2w ) Jo 

It is not necessary to evaluate the integral each time it is required, since 
tables of the distribution function F(X) are published, for example, in 
Cambridge Elementary Statistical Tables by D. V. Lindley and J. C. P. 
Miller. (It is assumed that the reader has a copy of these, or similar, 
tables.) The function F(X) defined by equation (52) is tabulated for 
values of X from X = 0 to X = 4-0. From the tables, when X = 0, the 
value of F(X) is 0-5, and this is understandable, since it represents the 
area from the left tail to the centre, or half the area under the curve in 
Fig. 15.12. Compare this with the result (53) which gives the area under 
the curve from X — 0 to X = 4, which is approximately half the total 
area. 
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As X increases towards the value 4, F(X) increases towards the value 
1, or the total area under the curve. When X = 0-67, then 
F(X ) = 0-748 6 =£= 0-75 using tables. Hence, between X = 0 and 
X = 0-67 there is approximately (0-75 — 0-5) = 0-25 of the total area; 
or, because of symmetry, approximately half the area lies between 
X — ±0-67. The area under the curve between X = ±1-96 i s 
2(0-975 — 0-5) = 0-95, or about 95 per cent of the total area lies 
between X = ±1-96. 

Example Let x be 7V(1, 4). Find the probability that x is less than 3. 

Solution Given x = JV(1, 4), write X = (x — l)/v'4 then X — N{ 0, 1) 
and 

P{x < 3} = P{X < K3 - 1)} = P{X < 1} = 0-841 
using tables to find the final answer. 


15.9.3 The normal distribution as an approximation to the binomial 
distribution 

When n is large and p is small the Poisson distribution gives a good 
approximation to the binomial distribution. When n is large and p is 
not necessarily small, the normal distribution is a good approximation. 
To show the degree of approximation consider the following two 
numerical examples. 


Example (i) A given discrete variable has a distribution with mean 12 
and variance 4. What is the probability that it is 14 or more ? 


Solution Here np = 12, npq = 4 so that q = \, p = \, n = 18, and 
the binomial distribution is 




The values for r = 0, 1, 2, 3,. . ., 18 are given in Table 15.6, and the 
histogram of these values is drawn in Fig. 15.13. The computation of 
values of / is made slightly easier by computing each term after the 
first from the previous term, so that 


m = (3) 18 = t 0 , 

/(1) = 18 x 2 /„ = t» 


and /(/•) = 


18 . 17 . 16 . . . .(18-r+l) /2W1 
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This method was used to find the values of/in Table 15.6, the values 
then written correct to four places of decimals. 


r 

4 

5 

6 

7 

8 

9 

10 11 

m 

0-0001 

0 000 7 

0-003 1 

0-010 5 

0-028 9 

0-064 3 

0-115 7 0-168 2 


12 

13 

14 

15 

16 

17 

18 

m 

0-196 3 

0-181 2 

0-129 4 

0-069 0 

0-025 9 

0-006 1 

0-000 7 


Table 15.6 

The required probability is given by 


P{r > 14} = 2 f(r) = 0-231 1, 

and this is represented by the area of the histogram in Fig. 15.13 to the 
right of AB. 



The normal approximation to the area of the rectangle ABCD is the 
area under the corresponding normal curve, or EBCF, i.e. the area 
under the normal curve from 13| to 14J. To evaluate the required 
probability using the normal approximation to this binomial distribu¬ 
tion it is necessary to evaluate the area to the right of 13|. The continuity 
correction of must always be allowed for. In this example, first 
standardizing the variate and then using tables, 


V4 


= Kx- 12), 
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and the required probability, 

P{x > 14} = P{X > K13| - 12)} 

= P{X > 0-75} 

= 1 - 0-773 4 
= 0-226 6. 

The approximation is good for such a small value of n. 

Example (ii) A random sample of 100 screws is taken from a batch 
which is known to be 3 per cent defective. Calculate the probability that 
the sample will contain (a) more than 5 defectives, (b) between 2 and 5 
defectives. 

Solution Here n— 100, p = 0-03, q = 0-97, so that u = np = 3 and 
tr 2 = npq = 2-91. 

Let x be the number of defectives per sample of 100, then x is N{ 3, 2-91). 
Let X be the standardized variate, so that 

X={x- 3)/V(2-91) = (x - 3)/1-706, 
and hence X is JV(0, 1). Then, 

(a) the probability that a sample contains more than 5 defectives is 
given by 

P{x > 5} = P{X > (4| - 3)/1-706} 

= P{X > 0-879 4} 

= 1 — P{X < 0-879 4} 

= 0-189 6; 

(b) the probability that a sample contains between 2 and 5 defectives 
is given by 

P{2 < x < 5} = P{l-5 - 3)/l-706 < X < (5-5 - 3)/l-706> 

= P{—0-879 4 < X < 1-466} 

= 0-928 7 - (1 - 0-810 4) 

= 0-739 1. 

It can be proved that if x represents the number of successes in n 
independent trials of an event where p is the probability of success in 
a single trial, then x -> N(np, npq ) as « increases. 

It is sometimes more convenient to work with the proportion of 
successes, x/n in n trials where p is the probability of success in a single 
trial. Then 


x/n-*N(p,pq/ri) as n increases. 
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15.9.4 Summary 

In any given problem (i) if n is small the binomial distribution, (37) 
of section 15.7 is used, (ii) if n is large and p is small the Poisson 
distribution, (44) of section 15.8 is used, and (iii) if n is large and p 
is either small or large the normal approximation may be used. 
The normal distribution is the easiest to use since this is tabulated for 
the standardized variate. The tables used in this chapter tabulate the 
function 


1 

V(2*0 


11 


e"** 1 dt. 


which is the probability that the standardized variate is less than X. 
Other tables give values of 


V(2' 




which is the probability that the standardized variate lies between 0 
and X. 


15.10 SOME USEFUL THEOREMS 

Let x and y be two independent random variables, i.e. the probability 
that * takes any set of values is unaffected by y and the probability 
that y takes a set of values is unaffected by x. Then, 

(i) if each value of x is multiplied by a constant c the mean of the 
resulting variable is c times the mean of x, i.e. 

(ex) = cx, 

(ii) the mean of the sum or difference of the variates x and y is the 
sum or difference of their means, i.e., 

fr±y) = x±y, 

(iii) the variance of the sum or difference of the variates is the sum of 
their variances, i.e. 

a\x ± y) = a\x) + o 2 (y), 

(iv) if each of x and y has a normal distribution then (x + y) has a 
normal distribution. 

The proofs of these theorems will not be given here and may be found 
in textbooks of statistics. 

EXERCISE 15 

1 A box contains 4 white and 2 red marbles. One marble is drawn 
from the box. What is the probability that the marble selected is 
(i) white, (ii) red? 
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2 What is the probability of obtaining the score (i) 6, (ii) 4 on rolling 
one die? What is the probability of obtaining a total score (iii) 12, 
(iv) 7 on rolling 2 dice? 

3 Find the probability that in 4 hands dealt from an ordinary pack 
of 52 playing cards each hand contains one ace. 

4 Find the probability that at least one head is thrown when 10 coins 
are tossed. 

5 A single die is rolled until a six is obtained. What is the probability 
that a six will be obtained (i) on the third attempt, (ii) on the «th 
attempt? 

6 Two different digits are chosen at random from the set 1,2, 3,. . .,9. 
Find the probability that (i) the sum of the digits is exactly 6, (ii) 
the sum of the digits is greater than 14, (iii) both digits are greater 
than 6. 

7 The probability that one particular component of a machine needs 
replacing in a 5-year period is £ and that of another is i. Assuming 
that these two events are independent, find the probability (i) that 
either one or the other needs replacing, (ii) that both components 
need replacing, (iii) that neither component needs replacing, in the 
period. 

8 A and 5 take turns in throwing a pair of dice. A starts and the first 
to throw a total score of 9 wins. What are the respective probabili¬ 
ties? 

9 A single die is rolled. What is the probability that the score is 3 if 
it is known that the score is odd ? 

10 A box contains 3 black and 2 red balls marked SI, 52, 53 and 51, 
52. Two balls are drawn in turn from the box without replacement. 
Find the probability that (i) both balls are red, (ii) both balls are 
of the same colour, (iii) a total score of 4 is obtained, (iv) the total 
score is 4 if it is known that both balls are black, (v) the first ball 
drawn was black if it is known that the second one drawn is 51. 

11 A random variable x has the frequency function fix) = cxt~ z , 
x ^0. Determine (i) the value of c, (ii) the distribution function 
F(x), (iii) P{x < 1}, (iv) 5{1 < x < 2}, (v) the mean and variance 
of x. 

12 A random variable has the frequency function fix) defined by 
f(x) = 0, x < 0; /(x) = cx, 0<x< 1; f(x) = c+l- x, 
1 < x < 2; f(x) = 0, x > 2. (i) Evaluate c, (ii) find the distribution 
function F(x) and sketch the graphs of fix) and Fix). 

13 Use coding to find the mean and standard deviation of the frequency 
distribution given in the following table. 

x I 2-30 2-35 2-40 2-45 2-50 2-55 2-60 2-65 2-70 2-75 
/ I 1 5 6 13 8 17 14 7 1 3 
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14 Examination marks are scaled either (i) by subtracting 10 from 
each mark or (ii) by subtracting 10 per cent of each mark. How do 
these forms of scaling affect the mean and standard deviation of 
the set of examination marks? 

15 Sketch the histograms of the binomial distributions in which p = l 
and (i) n = 5, (ii) n = 10, (iii) n = 25. 

16 Experience shows that 25 per cent of a new seed germinates. If 
1 024 packets each containing 5 seeds are tested, find the number of 
packets in which it is expected that 0, 1, 2, . . .,5 seeds germinate. 
Sketch the histogram. 

17 A firm makes a certain kind of switch and produces 0-5 per cent 
defective. They are packed in boxes of 100. What percentage of 
boxes (i) has no defective switches, (ii) has 2 or more defective 
switches ? Use (a) the binomial and (b) the Poisson distributions. 

18 Fit a Poisson distribution with the same mean to the following data: 

Number of faults I 0 I 2 3 4 5 

Frequency of faults | 18 28 25 17 9 3. 

19 The number of telephone calls received by an operator between 
8.30 and 8.35 a.m. fits a Poisson distribution with mean 3. Find the 
probability that (i) in any given day there will be no calls in that 
interval of time, (ii) in 3 given consecutive days there will be a total 
of 2 calls in that interval of time. 

20 A manufacturer knows that 15 per cent of his product is defective. 
It is inspected in batches of 60 units. Find the mean and variance 
of the number of defectives in batches of 60 and use tables of the 
normal distribution to calculate the approximate percentage of such 
batches which contain more than 10 defectives. 

21 On the same axes sketch the normal curves of / against x for one 
hundred trials of an experiment when x is (i) N( 0, 1), (ii) N( 0, 4), 
(iii) N( 0, 100). What percentage of the total area in each case lies 
between ±0-67cr, ±<r, ±l-96<r? 

22 The resistances of 200 similar resistors of nominal value 1 000 Q 
were measured to the nearest ohm and their values classified in the 
following table. 


Resistance x 1 

995 

996 

997 

998 

999 

1 000 

1 001 1 002 

Frequency / 

5 

0 

13 

12 

12 

22 

13 17 

Resistance x 

1 003 

1004 

1 005 

1006 

1007 

1 008 

1009 

Frequency / 

19 

20 

16 

18 

15 

12 

6 


Find the mean resistance and the standard deviation. Assuming a 
normal distribution with this mean and standard deviation find the 
probability that any one resistor falls outside the manufacturer’s 
guarantee that the values of the resistances are within ± 1 per cent 
of their nominal value. 
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Statistics II 


16.0 INTRODUCTION 

Broadly speaking, the aim of most statistical enquiries is to infer 
properties of a population from a sample from that population. The 
population corresponds to the frequency function, the sample corre¬ 
sponds to an observed frequency distribution. If the number of observa¬ 
tions is large, relative frequencies of the observed frequency distribution 
tend to probabilities of the frequency function. 

Thus, given an observed frequency distribution, the problem is to 
select a theoretical (mathematical) model, to check that this model is 
reasonable, and finally to draw conclusions about the population. 

There are three types of statistical problem to consider. The first is 
the estimation of parameters from observed frequency distributions, 
for example, the estimation of the value of the mean or variance of a 
population from the data from a sample. The second is the testing of 
hypotheses, which leads to significance testing. Finally, one test that 
the model chosen is a reasonable model will be given. 

16.1 METHODS OF SAMPLING 

A random sample is a sample from a population in which each individual 
member of the population has the same chance of being selected. 
Simple sampling is random sampling where the probability of selecting 
any individual remains constant throughout the sampling. If the popu¬ 
lation is very large or can be thought of as infinite, then every random 
sample is simple. If the population is finite, simple samples are obtained 
only if sampling is with replacement, i.e. after the selection of an 
individual it is replaced in the population. From now on in this chapter 
all samples are simple. 

16.1.1 Estimation 

In most cases a good estimate of an unknown population parameter is 
obtained by computing the corresponding quantity for the observed 
frequency distribution, so that, to estimate the population mean p the 
mean x (or m) of the observed distribution is used, to estimate the 
population variance o 2 the variance s 2 of the observed distribution is 
used. That is, 

estimate p by x = (Sn ( x: ( )/n, 

estimate a 2 by s 2 = [Xn^Xj — x ) 2 ]/n, 

where = n — total number of observations. 
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16.1.2 Definition 

Any quantity computed from the sample and used to estimate an 
unknown population parameter is called a statistic or estimator. 

16.2 SAMPLING DISTRIBUTION OF THE MEAN 

Suppose a sample of n observations x x , x 2 , x 3 , . . x n is taken from a 
population, then the sample mean is, say, x x = (Zx,)/«. Suppose a 
second sample is taken from the same population, then this will have 
a sample mean x 2 which may be different from x ± . A third sample 
under the same conditions will have mean x 3 , and so on. That is, the 
value of x varies with the sample, and the sample means may be 
tabulated as in Table 16.1. 

Sample number I 1 2 3 ... k 

Sample mean | x t x 2 x 3 ... x k 

Table 16.1 

Thus x is not constant over all samples but has a distribution, and the 
purpose is to find the frequency function for x. This is called the 
sampling distribution of the mean. 

Suppose the population has mean p and variance a 2 , and a random 

sample x u x 2 , x 3 . x n is taken from this population. The sample 

mean x = (Sx,)/n. Since the sample is random, x lt x 2 , x 3 , . . ., x n can 
be interpreted as n independent random variables. Also the mean of 
x f = p and the variance of x t = a 2 for all x t , since each is drawn from 
the same population. Then, 

(mean of x) = (mean of x x + mean of x 2 + . . . + mean of x n )/n 
= (p, + p + . . . + p)/n 
= np/n = p 

or, mean of x = population mean p; 

(variance of x) = var (x x + x 2 + x 3 + . . . + x n )/n a 

= [var (x x ) 4- var (x 2 ) + var (x 3 ) + . . . + var (x n )]/n 2 
since x x , x 2 , x 3 , . . ., x n are independent. 

Hence var (x) = (a 2 + a 2 + <r 2 + . . . + a 2 )/n 2 
= no 2 /n 2 
= n 2 /n. 

Hence mean (x) = p, var (Jc) = a 2 Jn. (1) 

If the population has a normal distribution, nx = x x + x 2 + x 3 
+ . . . + x n is also normal, i.e. 

nx = N(np, no 2 ), 
x = N(p, o 2 /ri). 


or 


( 2 ) 
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and the sampling distribution of the means is also normal. The standard 
deviation, which is the square root of the variance, of a sampling 
distribution is called the standard error , (S.E.), so that S.E. (x) = a/y/(n). 
The distributions of x = N(ji, 1) and of x = N(/t, i) are sketched in 
Fig. 16.1. 



If two independent samples from two different populations are being 
considered, then 

S.E. (x±y) = V{var (x ± y)} 

= \/{var (x) + var (j?)} 



where o\ is the variance of the population of x’s and the sample size, 
a\ the variance of the population of _y’s and n 2 the sample size. 

16.2.1 Population parameters estimated from a sample 

It is necessary to estimate the population parameters p. and a 2 from a 
sample from the population. First consider an estimate of the popula¬ 
tion mean [i to be the sample mean x. From equation (1) of section 16.2 
the mean of x is n and the variance of x is a 2 /n, so that the distribution 
of sample means is ‘narrower’ than the distribution of the population. 
This is shown in Fig. 16.1 with a = 1, n = 4. Hence x has a high 
probability of being near to / 1 . Moreover as the sample size increases, 
x approaches the value fi. Hence the mean of the population is estimated 
as the mean of a sample. That is 


fi = X 


(4) 
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The sample variance s 2 can be shown to be the fraction (n — 1)/« 
of the population variance, that is, it is too small. Hence for small 
samples the variance ns 2 /(n — 1) is used to estimate a 2 , and since 


ns 2 1 

n — 1 — n — 1 


c*i - xf. 


the estimate of a 2 is evaluated by dividing the sums of squares by 
(n — 1) instead of n. For large values of n the factor «/(« — 1) is not 
important. Hence the variance of the population a 2 is estimated from 
the sample as 

a 2 = n ” l s 2 = j- 2 (*i ~ x) 2 , n small, (5) 

a 2 = s 2 , n large. (6) 


16.3 TESTING HYPOTHESES AND CONFIDENCE 
INTERVALS 

A fairly general definition of a statistical hypothesis is that it is an 
assumption about the frequency function of a random variable. A test 
of a statistical hypothesis is a test to decide whether to accept or reject 
the hypothesis. In problems considered in this section, the object is to 
test an assumed statement, called a null hypothesis (N.H.) to see if it 
is acceptable. 


16.3.1 Significance test based on the normal distribution 

This test depends on knowing that the population being tested is 
normally distributed, on knowing the variance, and on having a reputed 
mean of the distribution. 

Suppose a sample of n individuals from a given population has mean 
Jc, the general procedure for testing this mean is as follows: 

(a) make the null hypothesis that the true mean is p, 

(b) note the difference between the sample mean Jc and p, 

(c) find the theoretical probability that in random sampling the differ¬ 
ence between Jc and p is greater than the observed difference, 

(d) if this probability is sufficiently small, reject the null hypothesis. 
That is, conclude from the given data that this sample is not from the 
given population whose mean is p. 

An example to demonstrate the procedure follows. 

Example A firm is producing packets whose weights are known to be 
normally distributed with variance 0-25 gramme 2 , and the mean weight 
is supposed to be 10 g. A random sample of 10 packets is taken and 
the sample mean is found to be 9-64 g. Is the firm giving correct weight? 
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Solution Let the weight in grammes be x. The sample mean x is 9-64, 
the number of items in the sample n is 10. Following the procedure for 
testing this value of x: 

(a) the N.H. is that the population mean fi is 10; 

(b) the sample mean x differs from n by 0-36; 

(c) the probability that in random sampling x would be such that 
|Jc — n\ > 0-36, i.e. the probability that \x — [i\ > 0-36 by chance is 
found by standardizing the variate and using tables of the normal 
distribution as follows. Let 

X=(x- M)/(o/Vn) 

= -0-36 V10/0-5 
= -2-28, 

then X = N(0, 1), and from tables, 

P{\X\7* 2-28} = 2 P{X > 2-28} 

= 2[1 -P{X< 2-28}] 

= 2[1 - 0-988 7] = 0-022 6 
< 0-05 or 5 per cent. 

(d) The critical probability level is usually taken at 5 per cent, and the 
test shows that x differs significantly from // at the 5 per cent level, and 
the null hypothesis is rejected. Hence it is not by chance that the 
incorrect weight is being given. 

Values of the standardized variate X and the corresponding proba¬ 
bilities are given in Table 16.2. 

P I 20% 10% 5% 2% 1% 0-1% 

|2f| > | 1-282 1-645 1-960 2-326 2-576 3-291 

Table 16.2 

The general procedure for testing means in normal populations is 
now re-written as follows: 

(i) make the null hypothesis that the true mean is / 1 , 

(ii) compute X = (x — where a 2 is the variance and n the 

number of observations in the sample, then X = N( 0, 1), 

(iii) if |Z| > 1-96, then x differs significantly from p at the 5 per cent 
level and the N.H. is usually rejected, 

(iv) if lA'I > 2-576, x differs significantly from fi at the 1 per cent level 
and the N.H. is always rejected, 

(v) if jV| < 1-96 there is no reason to reject the N.H. 

It is not possible to prove that the null hypothesis is true. If | A"| < 1-96 
the sample values are consistent with the null hypothesis, or there is no 
evidence from the given sample to reject the null hypothesis. 
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The above test is a two-tailed test in which the probability that \X\ 
is greater than a fixed number is given. A one-sided test in the above 
example would be say, a test for short weight and 
P{X < -2-28} = P{X> 2-28} 

= 1 - 0-988 7 
= 0-011 3 
< 5 per cent level. 


16.3.2 Confidence intervals (interval estimation) 

Assume that the population mean p is unknown and is estimated by 
the sample mean x. Then the 95 per cent confidence interval for p is that 
interval such that 

P{p — l-96or/V'« < x < (i + 1-96<t/v«} = 0-95 
i.e. P{x — 1-96(7 l<s/n < p < x + 1-96<r/\/«} = 0-95 

since if x is within ± l-96cr/v , « of ft, then ft is within ± 1 •96cr/v / « of x. 
In the above example x = 9-64 and the interval becomes 
9-64 - 1-96.0-5/V10, 9-64 + 1-96.0-5/VT0 
which is 9-33 to 9-95. 

A confidence interval for any parameter may be thought of as the 
values between which that parameter must lie for a test of significance 
to be non-significant. In terms of the mean, with standard error a/\/n, 
as n increases, of^/n decreases so that the width of the confidence 
interval decreases and accuracy increases. 


Example (i) The values of resistors produced by a given machine appear 
to have a normal distribution with mean 29 Q and variance 1-44 Q 2 . 
The process is modified in such a way that the variance is not affected 
but the mean may be. A random sample of 10 resistors is taken and 
the values in Q are 


30-31 31-14 30-14 30-22 27-87 

28-37 31-37 29-70 29-07 29-41 

Has the mean changed ? 

Solution The N.H. is that there is no difference in the mean, or that 
the mean p is 29. The sample mean x is 

29 + (1-31 + 2-14 + . . . + 0-41)/10, 


or x is 29-76. 

The standardized variate X is given by 


m = 


129-76 - 29[ 
<t/V« 


0-76-y/lQ 


= 2-003. 


1-2 
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This is greater than 1*96 and hence the N.H. is rejected. That is, x 
differs significantly from p at the 5 per cent level and modification has 
tended to increase the mean. 

It is of interest to note that the 95 per cent confidence limits for the 
new mean are x — 1-96a/\/«> x + 1-9 6a/-\/n which are 29 02 and 
30-50. The interval between these limits does not contain the value 29, 
of the old mean, and this was to be expected from the result found. 

Example (ii) The classic blindfold drinking test is based on the binomial 
distribution and the technique of testing is the same. A person claims 
that he is able to distinguish between two drinks A and B. In an experi¬ 
ment he is correct 5 times in 6 trials. Does this result justify his claim ? 

Solution The N.H. is that the person guesses, i.e. that he is equally 
likely to choose A or B, i.e. that the number of correct guesses has a 
binomial distribution with p = £, q — Let r be the number of 
successes, then from the binomial distribution, with n = 6, 



The mean number of successes is given by np which is 3, and the 
observed number of successes is 5. Hence the difference is 2. The 
probability that this difference is greater than or equal to 2 is given by 

P{\r - np\ > 2} =/(0) +/(1) +/(5) +/(6) 

- (Ml + 6 + 6+1} 

= 14/64 
= 0 - 22 . 

This probability is greater than 5 per cent, and hence there is no reason 
to disbelieve the null hypothesis that the person guesses. 

16.4 LARGE SAMPLE THEORY 

If it is known that a population is normally distributed, then the 
distribution of the means of samples from the population is also 
normally distributed. If the population distribution is N(p, o 2 ), then 
the sampling distribution of the mean is N(p, o 2 /n). 

If the population is not normal, it is still true that the mean of the 
distribution of the means x is p and its variance is a 2 In, but x is not 
normally distributed. It can be shown that the distribution of the means 
x tends to normality as n increases. This is known as the Central limit 
theorem which may be stated as follows. 
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Let x u x 2 , x 3 , . . ., x n be n independent random variables from the 
same distribution (not necessarily normal), with mean (x t ) = p, 

variance (x,) = a 2 , for all values of i. If x = ^ jn, then the 

theorem states that, as n -► oo, (x — p)j{aj^/n) N(0, 1). 

The distribution of most estimators computed from a sample of size 
n approach normality as n increases. 

16.4.1 General procedure for large sample testing 

A large sample is one which contains more than about fifty observa¬ 
tions, and the procedure for testing a parameter 6 is as follows: 

(i) make the null hypothesis that the value of the parameter is 0, 

(ii) compute an estimate of 0 from a sample, say d 0 , 

(iii) evaluate the standard error of 0 O , say s 0 , 

(iv) compute the standardized variate X given by 

1*1 = \o 0 - e\is 0 . 

Then, 

if |*[ >2, corresponding approximately to 5 per cent significance, the 
N.H. is usually rejected, 

if |*| ^ 3, corresponding approximately to 1 per cent significance, the 
N.H. is always rejected, 

if |Aj < 2, there is no evidence from this sample to reject the N.H. 

16.4.2 Testing the mean of a large sample 

Let a sample of n (large) observations have values x u x 2 , x 3 . x n , 

and let the problem be to test whether the population from which the 
sample is drawn has mean [i. The population is not necessarily normal, 
and the variance is unknown. Using the procedure of section 16.4.1, 6 
now being the mean, 

(i) the N.H. is that the population mean is //, 

(ii) the sample mean x is (Sx f )//i, 

(iii) the standard error of the mean is a/y/n and a is unknown. Hence 
a 2 is estimated from the sample by s 2 the sample variance, where 

a* = [2(x, - x) 2 ]/n 
= [£x? - x 2 ]/n. 

(iv) The standardized variate X given by \X\ = \x — p\/(s/y/n) is now 
tested. 

Example A sample of 900 observations is found to have a mean 3-5 
and standard deviation 2-62. Can it be regarded as a random sample 
from a population whose mean is 3-34 ? 
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Solution The N.H. is that the mean [i = 3-34. The mean of the sample 
x = 3-5 and the estimated standard deviation s = 2-62. Hence the 
standardized variate X is given by 


m = 


|3-5 - 3-34| 
2-62/V(900) 


_ 4-8 
"2-62 


and the result is obviously less than 2. Hence the result is not significant 
and there is no reason to suppose that this sample was not drawn from 
the population whose mean is 3-34. 

It is of interest to note that in this example, the 95 per cent confidence 
limits for the mean are x — 1-96 s/y/n, x + 1-9 6s/\/n, which equal 
3*33, 3-67 (or x — 2s/\/n, x + 2sj\/n, which equal 3-33, 3-67), and the 
interval between these limits includes the mean of the population. 


16.4.3 Testing the difference between two means 

Two problems are considered: 

(a) given two independent samples of n x and n 2 observations, they are 
tested as to whether they come from the same population, 

(b) given two samples from different populations, to test whether the 
means of the populations are the same. 

The procedure is as follows: 

(a) the N.H. is that both samples are drawn from the same population 
whose mean is fi and whose variance is or 2 . 

Let x± be the mean of the sample size n x and x 2 be the mean of the 
sample size n 2 . Then var (xj) = o 2 ln u var (x 2 ) = o 2 jn 2 and using 
section 15.10 theorem (iii), 

var (x x - x 2 ) = o 2 ln x + o 2 /n 2 

so that S.E .(x x -x 2 ) = oJ{^ + ^j. 

If a 2 is unknown a pooled estimate s 2 is used where 

.r 2 = ? P(x t - x,) 2 + S(x 2 - x 2 ) 2 ] 

n x + n 2 

_ n x s\ + n 2 s 2 
~ n x + n 2 
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Also, since the mean of (x a — x 2 ) is zero, using section 15.10 theorem 
(ii), the standardized variate X is given by 

jfe - jg) - 01 


and X may be tested as usual. 

(b) The N.H. is that the means of the two populations are each equal 

to fl. 

Let the sample means be x x and x 2 of samples size and n 2 respectively 
from populations whose variances are a* and a\. Then, 

var (x x — x 2 ) = var (x x ) + var (Jc 2 ) 

_ a\ <r| 

~ «X + «2 

and S.E . Wl -« = yg + |), 

mean (x x — x 2 ) = n — (i = 0, 

and finally 

Y l(*i ~ x 2 ) - 0[ 

' 

which may be tested as usual. 

If a\ and a\ are unknown, they may be estimated from the samples by 
s\ = - jc x ) a 

and si = - x 2 y. 

These tests are exact if (i) the populations are known to be normal 
and (ii) the variances are known. If (i) holds and (ii) does not, it is 
better to use the /-test which is an exact test valid for samples of any 
size and is discussed in section 16.6.1. 

Example (i) The means of two random samples of 100 and 200 observa¬ 
tions are found to be 66-5 and 67-1 respectively. Can these samples be 
regarded as drawn from the same population whose variance is 6-25? 
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Solution The N.H. is that both samples are drawn from the same 
population, or, that there is no difference in their means. Then, mean 
(*! — x 2 ) = 0, <r 2 = 6-25, so that a = 2-5, and 


Finally, 


S.E. x 2 ) + 20Q j 

0-75 
" V6‘ 

y jfe ~ ~ 0| 

A " S.E. 

0-6V6 
“ 0-75 


<2 


and there is no reason to reject the null hypothesis, and both samples 
could be regarded as having been drawn from the same population. 


Example (ii) A sample of 7 000 observations has a mean 46-9 with 
standard deviation 1*95 and a sample of 2 000 observations has a mean 
45-9 with standard deviation 2-02. Is the difference in their means 
significant? 


Solution The N.H. is that the samples are drawn from different popula¬ 
tions with equal means, or, there is no difference in the two means. 
Estimate a\ by sf = (1-95) 2 and a\ by = (2-02) 2 , then 


s.e -y(^+ 


2 - 02 3 \ 
2 000 / 


^0-050 8, 


|*i — x 2 \ = 1, and 


X = 


-> 19 

0-050 8 


Hence the difference in the means is highly significant. The null hypo¬ 
thesis is therefore rejected. There is a difference in means. 


16.5 SAMPLING ATTRIBUTES 

Sometimes it is required to find the probability that an individual has 
a certain characteristic. For example it may be required to find the 
probability that an individual is a cigarette smoker, given the proportion 
of men in that age group who smoke; or the probability that a manu¬ 
factured part is defective knowing the proportion of defective parts 
produced by the machine making the parts. 
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The general method is to consider a sample of size n, and suppose 
that r out of n possess the attribute under consideration. Then r has a 
binomial distribution and 


P{r out of n have the attribute} 




The mean value of r is np, and to test the hypothesis that the probability 
of an individual having the attribute is p, r is compared to np, that is, 
the theoretical probability that \r — np\ is greater than the observed 
difference is evaluated. If r 0 is the observed value of r, then 
P{\r — np\ > \r 0 — np\) is computed. 


16.5.1 Large sample test of a value of p 

For a large sample the normal distribution may be taken as an approxi¬ 
mation to the binomial distribution, and r is approximately N(np, npq). 
Then the procedure is as follows: 

The N.H. is that the probability any individual has the attribute is p, 
then the mean (r) is np and the variance (r) is npq. Hence the S.E, 
(/*) = -\/(npq ), and the standardized variate X given by 

y l r ~ M 

~ V(npq) 

is tested in the usual way. 

Example (i) A six-sided die is thrown 9 000 times and a 5 or a 6 is 
obtained 3 210 times. Is the die biased? 


Solution The N.H. is that the die is unbiased, i.e. 

p = P{5 or 6} = h q=l-p=l 
The expected number of (5 or 6) is np, i.e. 3 000. Then 
|r — np\ = 210, 
and S.E. (r) = -\/(npq) 

= V2 000 
= 44-72, 


so that 


X 


210 

44-72 


This result is highly significant and the die is biased. 
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Alternatively, instead of working with r, it may be easier to work 
with the proportion -n = rjn, then as 

r N(np, npq), 

■7T N(p, pq/n), 

s.e. ( 77 ) = Vipq/n) 


and X= - /7 - ~ 

Vipql") 

-* N(0, 1) 

which may be tested as before. 

Example (ii) Using the data of the previous example and the proportion 
77 , find whether the die is biased. 

Solution Here 


On the N.H. that p = i = 0-333 3, 

\ir-p\ = 0023 4 
and S.E. (77) = V(pql n ) 

= 0-004 97. 


Hence 

as before. 


0-023 4 
X ~ 0 004 97 
= 4-7, 


16,5.2 Comparison of two large samples 

Suppose that there are two samples of n x and n 2 observations where 
«! and « 2 are large, and that in the first sample r ± out of n l have the 
attribute A, and in the second sample r 2 out of n 2 have this attribute. 
Then it is possible to test whether these two samples come from the 
same population, or whether the probability, p, that an individual has 
the attribute A is the same for the two populations from which the 
samples are drawn. For, on the null hypothesis that p is the same for 
both populations, let -rr 1 — r 1 /n l and ir 2 = r 2 /n 2 , then approximately, 

771 = N(p, pq/n,), 

772 = N(p, pq/n 2 ). 


Hence, 
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and the observed difference — ir 2 is compared with the theoretical 
difference zero, using 

v _ Kl ~ 7r a\ 

S.E. (TTi - tt 2 )‘ 

If p, q are unknown, a pooled estimate p of p is used, where 

0 = H±h 

P «i + n 2 

_ n-jTTy + n 2 TT 2 

~ n 1 +n 2 

and q = l — p. 

Example In a test involving the safe driving of nineteen-year old motor 
cycle owners driving approximately the same mileage, it was found that 
in one large city 600 out of a sample of 800 had no accident in one 
given year, while in another city in the same year, 700 out of a sample 
of 1 200 drivers had no accident. Do these results show that one city 
has a better safety record than the other for this class of driver? 

Solution The N.H. is that the proportion of safe drivers is the same for 
both cities. 

The relevant calculation is set out below: 


= 800, r x = 600, 77-j = 
n 2 = 1 200, r 2 = 700, 77*2 = 7/12, 
p = (600 + 700)/(800 + 1 200) = 13/20, 
q = 7/20, 

7Tj — 7T 2 = 1/6, 

S.E. (tt, - tt 2 ) = V(13 X 7)/(20 X 20 X 480) = 0 021 77 


and 


X = 


0-166 7 
0-021 77 


>7. 


The result is highly significant, and the null hypothesis is rejected. That 
is the proportion of safe drivers in the two cities is different. 


16.6 SMALL SAMPLE THEORY 

Tests of significance for a large sample are constructed by comparing 
the modulus of the difference between observed and expected values of 
a statistic with the standard error, since the distribution of most 
statistics tends to normality as the number of individuals in the sample 
increases. For small samples most distributions of statistics are far from 
normal, in particular estimates of parameters obtained from small 
samples are not reliable. For example, substituting the sample variance 
s 2 for the unknown population variance a 2 affects the distribution 
seriously. It is necessary to construct exact tests which are valid for 
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small samples (i.e. samples of between 5 and 50 individuals). A small 
sample test which is exact and which is valid for samples of all sizes is 
Student's t -test, first derived by W. S. Gossett (1908) who wrote under 
the name ‘Student’. 

16.6.1 The {-distribution 

Suppose x t , i = 1, 2, 3, . . ., n is a random sample of n observations 
from a population whose mean is fi and whose variance is a 2 . If the 
population is normal then the distribution of x is N(jx, o 2 jn). If a 2 is 
unknown it is estimated by 

i 2 = (S(x, - xfW - 1), (7) 

using equation (5) of section 16.2.1. Then (x — /x)/(s/\/n) no longer 
has a normal distribution, although its distribution tends to normality 
as n increases. The exact distribution of (x — fx)j(s/\/n) is known and 
is called the /-distribution with n — 1 degrees of freedom. The n variables 
in equation (7) are not independent since they are connected by the 
relation S(x ( — x) = 0, so that the number of independent variables is 
reduced from n to (n — 1). 

The /-distribution is tabulated in Table 3 of Cambridge Elementary 
Statistical Tables, Lindley and Miller, and a short extract from this is 
given in Table 16.3 below. 


Number of degrees 
of freedom 

I Significance levels 

5 % 

1% 

1 

12-71 

63-66 

5 

2-57 

4-03 

10 

2-23 

3-17 

15 

2-13 

2-95 

00 

1-96 

2-58 


Table 16.3 


It is usual to write 

'»-i = (x - fiWIVn). 

Note that, as n -*■ oo, t n -> 1V(0, 1). 

16.6.2 Testing a mean, variance unknown 

To test whether the sample of n observations x X) x z , x 3 , . . ., x n is 
drawn from a normal population of mean fi, the procedure is as follows. 
The N.H. is that the population mean is fi, the sample mean x is 
(Ex, )/n, the estimate of the variance s 2 is given by 
f 2 = (S(x t - x) 2 }/(« - 1) 
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and t n -x = (x — nWIVn) has a 1-distribution with n — 1 degrees of 
freedom. 

Let t u be the a per cent point of the /-distribution with n — 1 degrees 
of freedom, i.e. 

F{|/| > Q = oc/100. 

Then if the modulus of the observed value of t is greater than or equal 
to t m the mean x differs significantly from p at « per cent. The value of 
a is usually taken to be 5. 

Example A sample of ten observations from a normal population has 
values 


31-31 32-14 31-14 31-22 28-87 

29-37 32-37 30-70 30-41 30-07. 

Test whether this sample was drawn from a population whose mean 
is 30. 


Solution The N.H. is that the population mean p is 30. From the data, 
x — 30-76, n = 10 so that n — 1 which is the number of degrees of 
freedom, is 9. Also the estimate of variance s 2 is given by 


Hence 

and 


s 2 = [2(x - x) 2 ]/9 
= [Sx 2 - (Sx) 2 /10]/9 
- 1-251 3. 

S= 1-118 6, 
s/Vn = 1-118 6/V10 


= 0-353 7. 


Hence, t 0 = (x — p)/(§/V n ) 

= (0-76)/0-353 7 
= 2-149. 


From tables the 5 per cent point of t 9 is 2-26. The observed value of t g 
is 2-149 which is less than 2-26. Hence the result is not significant at 
the 5 per cent level and there is no reason to reject the null hypothesis. 
The a per cent confidence interval for p when p is unknown is 
x - tJ/Vn, x + tJ/Vn 

where t a is the a per cent point of t with n — 1 degrees of freedom. 
Thus in the above example the 95 per cent confidence interval for p is 

30- 76 - 2-26 X 0-353 7, 30-76 + 2-26 x 0-353 7 which equal 29-96, 

31- 56, i.e. the mean of the population lies between 29-96 and 31-56 at 
the 95 per cent level of significance. 
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16.6.3 Testing the difference between two means 

Given two independent samples of n u and n 2 observations, the t- 
distribution is used to test whether the two samples may be regarded 
as drawn from the same normal population, assumed to be N(p, a 2 ). 
Let Jc! and x 2 be the means of the samples, then 

Xj = N(ji, a 2 l ni ), 
x 2 = N(p, o 2 /n 2 ) 

and X! — x 2 = N(0, u 2 (l/n 1 + 1 /n 2 )). 

To estimate a 2 it is necessary to use a pooled estimate s 2 using data 
from both samples, where 

*2 _ [£(x t — *i) 2 4- L(x a — x 3 ) a ] 

5 “ («1 - 1 ) + (« 2 - 1 ) 



(«1 - + («2 - i>! 

(B) 


+ n 2 - 2 

Then 

S.E. (x x - x 2 ) = V(l/«i + l/n 2 ). 

(9) 


From the null hypothesis that both samples are drawn from the same 
population, 

S.E. (x x - x 2 ) 

has a /-distribution with «i + n 2 — 2 degrees of freedom and can be 
tested as before. 

Example The measurements of a sample of 10 components from one 
machine has mean 60 cm and an estimated variance of 49 cm 2 . A 
sample of 16 components from a different machine has a mean 50 cm 
and an estimate of variance 100 cm 2 . Test whether both samples are 
drawn from the same population. 

Solution The N.H. is that there is no difference in the means of the 
two samples. Then with n x = 10, x 1 = 60, s 2 = 49, n 2 = 16, Jc 2 = 50 
and s 2 = 100, the pooled estimate of variance is, using equation (8), 
s 2 = (9 x 49 + 15 X 100)/24 
= 80-875. 

Hence s = 8-994 and, using equation (9), 

S.E. (x x - x 2 ) = 8-994^/(1/10 + 1/16) 

= 3-625. 

Hence / = |x x — jc 2 |/S.E. (x x — x 2 ) 

= 10/3-625 
= 2-759 

with 24 degrees of freedom. 
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From tables the value of t for 24 degrees of freedom at the 5 per cent 
level of significance is 2-06 which is less than the observed value of 
2-759. Hence the result is significant, there is a significant difference in 
the means and the null hypothesis is rejected. 

16.6.4 Paired observations 

It may be possible to arrange a controlled experiment so that two 
samples each of the same number of observations are deliberately 
paired. 

Suppose x t and y t , i = 1, 2, 3, . . ., n are two sets of observations 
which can be paired and that their means are x and y respectively, and 
let z 4 = x t — y t , i = 1, 2, 3,. . ., n. The null hypothesis is that there is 
no difference in the populations from which the samples are taken. 
Hence if x = N(p, a 2 /n), y = N(ji, o 2 lri), then z = x — y = N( 0, a 2 ) 
where cr| is estimated by = [2(z — z) 2 ]/(« — 1). The statistic 
t = |z — 0\/(sJ\/n) has n — 1 degrees of freedom and may be tested 
as before. 

Example To compare the effect of different fertilizers A and B on the 
growth of tomatoes, a controlled experiment was performed using six 
different positions in a greenhouse and the yield in pounds weight was 
as follows: 


Position 

1 

2 

3 

4 

5 

6 

A(x) 

13-2 

11-1 

12-2 

12-6 

10-4 

14-0 

B(y) 

12-9 

11-4 

10-3 

10-8 

9-8 

13-5 


Test whether one fertilizer is significantly better than the other. 

Solution The N.H. is that there is no significant difference between A 
and B. Here 

z = x-y | 0-3 -0-3 1-9 1-8 0-6 0-5, 

and Szj = 4-8, z = 0-8, Szf = 7-64 and (Sz,) 2 = 23-04, so that 
s* = M7-64 - *(23-04)] = 0-76. 

Hence s z = 0-871 8, sjy/n = 0-355 8 and finally 
t = 2-252. (=0-8/0-355 8) 

From tables, t — 2-57 for 5 degrees of freedom so that the difference 
is almost significant at the 5 per cent level, and there is insufficient 
evidence from the data to show whether one fertilizer is better than the 
other. 
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16.7 THE CHI-SQUARE DISTRIBUTION 

Let X u X 2 , X 3 , . . X n each be normally distributed with mean zero 
and unit variance. Then U = X\ + X\ + X\ + . . . + Xl has a 
known distribution called the Chi-square distribution, written y 2 distri¬ 
bution, with n degrees of freedom. This distribution which is continuous 
is tabulated and Table 5 of Cambridge Elementary Statistical Tables 
gives the percentage points of the % 2 distribution. An extract from this 
table is given in section 16.7.3. 

16.7.1 Properties of the y 2 distribution 

(i) If U u U 2 , U 3 , . . U m have independent y 2 distributions with 
«i, n 2 , n 3 , . . ., n m degrees of freedom, then U 1 + lf 2 + U 3 + . . . 
+ U m has a y 2 distribution with n — n x + n 2 + « 3 + . . . + n m 
degrees of freedom. 

(ii) If the X t are not independent but satisfy k linear equations of the 
form 

®u.Ai 4- Oi 2 X 2 + Cj 3 A 3 + . . . + ai n X n = 0, i = 1 , 2, 3,. . ., k, 
these k equations give k constraints to the system and it can be shown 
that X 2 + X$ + X* + . . . + X% now has a y 2 distribution with 
n — k degrees of freedom. 

16.7.2 Distribution of sample variance 

Let x u x 2 , x 3 , . . ., ^ be a random sample of « independent values 
from a normal population of variance cr 2 and let x be the sample mean. 
Then the sample variance s 2 is given by 

* 2 = [S(x f - x) 2 ]/n. 

Hence ^ = V^l£L 2 , 

<t 2 a 2 

and since (x, — x)/a is approximately jV( 0, 1), then ns 2 /a 2 has a y 2 
distribution with one constraint, for [E(x, — x)]/a = 0. Hence ns 2 1 a 2 
has a y 2 distribution with (n — 1 ) degrees of freedom. 

16.7.3 Short table of y 2 



I Significance level 

g 

5% 

1% 

15 

2500 

30-58 

10 

18-31 

23-21 

5 

11 07 

15-09 

1 

3-84 

6-63 


Table 16.4 
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Notice that if X = N(Q, 1), x \ = and 

P{\X\ > 1-96} = 5 per cent, 

P{X 2 > 1-96 2 } = 5 per cent 
P{X 2 > 3-84} = 5 per cent, 

P{xl > 3-84} = 5 per cent. 

Example A random sample of eleven independent measurements gave 
an unbiased estimate of s 2 equal to 9-65 mm 2 . Could the sample be 
reasonably regarded as drawn from a normal population with variance 
5-5 mm 2 ? 

Solution The N.H. is that the variance a 2 is 5-5. From the given data, 
ns' 2 1a 2 = 11 x 9-65/5-5 = 19-30, and from tables of % 2 with 10 degrees 
of freedom, 

P{ x 2 > 19-30} < 5 per cent. 

Hence the value found is significant and the null hypothesis is rejected. 
The sample is not regarded as drawn from a normal population with 
variance 5-5 mm 2 . 


or 

i.e. 

or 


16.7.4 Application of y 2 to goodness of fit 

Earlier in this chapter methods of testing parameters were shown on 
the assumption that the distributions selected were correct. The aim 
now is to test the form of the distribution assumed. On the assumption 
that a certain distribution is correct, it is possible to evaluate expected 
frequencies and compare these with observed frequencies. Let E u 

i = 1, 2, 3, . . ., k and O t , i = 1, 2, 3. k be the expected and 

observed frequencies for k possible outcomes of an experiment. Then 
the measure of discrepancy U say, between the observed and expected 
frequencies is chosen to be 


u = 2 


(Pi ~ E t ) 2 


If O and E differ, U will be large and it is necessary to have a significance 
test for U. For large numbers of repeats of the experiment, the distribu¬ 
tion of U approaches a % 2 distribution with k — 1 degrees of freedom. 


Example (i) A die is rolled 120 times. For an unbiased die each face 
occurs with the probability i so that each face is expected to occur 20 
times. The experiment gave the result in the following table which shows 
both observed and expected frequencies: 


Score 

1 

2 

3 

4 

5 

6 

Observed frequency 

15 

18 

30 

16 

24 

17 

Expected frequency 

20 

20 

20 

20 

20 

20 


Are the data consistent with the expected frequency ? 
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Solution The N.H. is that the die is unbiased. Then 


2 


(O - Ef 
E 


(—5) 2 
20 


= 8-50, 


(~2) 2 10 2 (-4)2 4_ 2 

20 + 20 + 20 + 20 


(-3)_ 2 

20 


and there are 6 — 1=5 degrees of freedom. 

From tables, xl = 11-07 at the 5 per cent level, and the observed 
value of 8-5 is less than 11-07 and the result is not significant. There is 
no reason to suspect the bias of the die. 

Example (ii) In a certain manufacturing process any one defective item 
occurs randomly in time and is independent of the time any other 
defective occurs. Records give the frequency of occurrence O z of x 
defects in one day as 

x I 0 1 2 3 4 5 

O x I 305 365 210 80 28 12 

Test the goodness of fit of a Poisson distribution whose mean is 1-2. 

Solution The N.H. is that the distribution is a Poisson distribution 
whose mean is 1-2. Expected frequencies E x are given by 

E x = [1 000 exp (—1-2)](1-2)7 jc!, 
and the values in the following table are calculated: 

X o, E, O-E (O - £) a (O - E) 2 IE 


0 305 301-2 3-8 

1 365 361-4 3-6 

2 210 216-8 -6-8 

3 80 86-7 -6-7 

4 28 26-0 2-0 

5 12 7-6 4-4 


14-44 0-047 94 

12-96 0-035 86 

46-24 0-213 28 

44-89 0-517 76 

4-00 0-153 84 

19-36 2-547 37 


Total 3-516 05 


Table 16.5 

Thus x 2 is 3-5 and the number of degrees of freedom is 6 — 1 or 5. 
From tables the 5 per cent level of f is 11-07 which is larger than the 
observed value of 3-5. Hence the result is not significant and there is 
no reason to reject the null hypothesis. That is, in this case, there is no 
reason to believe that the defects are occurring other than at random, 
and from this data there is no evidence of a fault in production. 

16.7.5 Modifications 

(i) When the population distribution is not known completely, and it 
is required to test the hypothesis that a sample is drawn from a specified 
distribution the parameters of which are not known, it is necessary to 
estimate the parameters using observed data from the sample. In this 
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case the resulting % 2 is tested as above but the number of degrees of 
freedom is reduced by the number of parameters estimated. That is, the 
number of degrees of freedom is given by 

(the number of classes) — 1 — (the number of parameters estimated). 


(ii) If the expected frequency E t in any one class is less than five, it is 
necessary to combine class results as follows, for example, 


Class 1 

Oi 5 

E, 2 


2 3 4 5 6 

6 13 18 17 16 

3 15 20 17 15 


The first and second classes are combined to give 

Class 1 and 2 3 4 5 6 

O t 11 13 18 17 16 

E, 5 15 20 17 15 


The number of degrees of freedom is now 

5 — 1 — (the number of parameters estimated). 


16.7.6 Contingency tables or test of independence 

Another useful application of the % 2 test occurs when one sample is 
classified according to two factors, and it is required to know whether 
these factors are independent. It is usual to represent the data in a 
contingency table. 

Example A group of 143 students is tested in mathematics and physics 
and graded «, /?, y in each subject. Use the % 2 test to test the hypothesis 
that the grades in mathematics and physics are independent. The con¬ 
tingency table for this is given in Table 16.6. 





Mathematics Grades (B) 



a 

V 

y 

Totals 

3 

a 

33 

13 

5 

51 

1 

2 

fi 

6 

49 

11 

66 

8 

y 

2 

3 

21 

26 

r 

Totals 

41 

65 

37 i 

143 


Table 16.6 

Before considering this particular example, consider a general con¬ 
tingency table containing r rows and s columns. Suppose factor A has 
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r levels A u A 2 , A 3 ,. . A r and factor B has s levels B u B 2 , B z ,. . ., B, 
and let there be n u observations in class (A { , Bj) and the contingency 
table is shown in Table 16.7. 




b 2 

b 3 

B, 

B, 

Totals 

Ai 

nu 

ni2 

n i3 

nu 

n u 

«i. = 2 ni, 

A. 

"2i 


«. 3 ■ 

n a) 

• • n 29 

n 2 . = 2 n 2 , 

a 3 

«31 


"33 ‘ 

.. n 3t . 

. . n 3 . 

n 3 . = 2 n 3 , 

A, 

- 

" ,a 

3 

n„ 

nt. 

«j. = 2 nu 

Ar 

n r i 

nr. 

- ■ 

n„ 

.. nr. 

n„ = 2 n r , 

Totals 

2 «u 

2 n, 2 

2n, a 

n.i 

2 nu 

2 /!„ 

- 


Table 16.7 


On the null hypothesis that the two factors A and B are independent, 
the probability that an observation falls in class (A„ B,) is the product 
of the probability that it falls in A t and the probability that it falls in 
B„ or p u = (/>,. p.j). Hence expected frequencies 

Ea = np t) = np t . p.j 
and x 2 = 2(0 - Ef/E 

(n lf - np t . p.,) 2 
if n Pi-Pi 

with rxs-1 degrees of freedom. 

In general p { . and p. t are unknown parameters and are estimated by 
p t . = proportion of observations in A, 

= fti./n, 

p.j = proportion of observations in Bj 
= n-j/n, 

where n t . = 2 n„ and n., = 2 n i/; and finally 

2_v v w< ‘ n, ilE ) 2 

X ff fii.n.j/n ' 


( 10 ) 
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Since j p,. = 1 and % p . y = 1, the total number of degrees of 

freedom is reduced by one for each parameter estimated so that there 
are r — 1 + s — 1 parameters estimated. Hence the number of degrees 
of freedom for the contingency table is 

rs - 1 - (r - 1) - {s - 1) = (r - 1)(* - 1). 

Hence testing independence in Table 16.6, 

£(a p a m ) = 51 X 41/143 = 14-62, 

E(<x p pJ = 51 X 65/143 = 23-18, 

£(a p y m ) = 51 X 37/143 = 13-20, etc. 
where a„ means a grade a in physics, etc., and 

, (33 - 14-62) 2 (13 - 23-18) 2 (5 - 13-20) 2 

x ~ 14-62 + 23-18 + 13-20 + ’ ' - 

(21 - 6-728) a 
+ 6-728 

= 106-52. 

Since x 2 = 9-49 for (3 — 1)(3 — 1) = 4 degrees of freedom at the 5 per 
cent level, this result is highly significant and the hypothesis of inde¬ 
pendence is rejected. 

Special case of 2x2 contingency table 

The x 2 test is an approximation which improves as the number of cells 
in the contingency table increases. In the special case of 2 X 2 tables 
(i.e. r = s = 2) the approximation is improved by using Yate’s continu¬ 
ity correction. The conventional notation for a 2 X 2 contingency table 
is shown in Table 16.8. 


Characteristic B 


Table 16.8 
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To apply Yates’ correction, decrease by £ those O > E and increase 
by \ those O < E and suppose the corrected table is 

a' b' n x 

c' d' n 2 

« 3 n 

Then it may be shown that an equivalent expression for is 
2 _ nja'd’ - b'c'Y 
X ~ n 1 n 2 n 3 n i 

Example It is thought that people who wear spectacles for reading have 
jobs which involve much reading. Two hundred people were investi¬ 
gated and the following table records the result. 

S Non-5 Totals 
R 74 31 105 

Non-jR 46 49 95 

Totals 120 80 200 

Test the independence of the two factors. 

Solution The N.H. is that the two factors are independent. 

E(R,S)= 105 x 120/200 = 63. 0> E, 
hence decrease 74 by 

E (R, Non-S) = 105 X 80/200 =42, O < E, 
hence increase 31 by 

E (Non-7?, 5) = 95 X 120/200 = 57, O < E, 
hence increase 46 by 

E (Non-/?, Non-5) = 95 x 80/200 = 38, O > E, 

hence decrease 49 by £. 

The corrected table is 

S Non -S Totals 

R 73-5 31-5 105 

Non-/? 46-5 48-5 95 

Totals 120 80 200 
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From the table, 


(73-5 X 48-5 - 46-5 x 31-5) 2 . 

120 x 80 x 105 x 95 X 20U 


From tables of y 2 , the 5 per cent level of % 2 with one degree of freedom 
is 3-84. 

The observed value of % 2 is 9-219 which is greater than 3-84, and the 
result is highly significant and there is no reason to suppose indepen¬ 
dence of the two factors in this group of people. 

16.7.7 Test of homogeneity 

The % 2 test may be used to determine whether several samples are 
drawn from the same population or populations having the same 
distribution. The precise nature of the distribution is unimportant. 

Suppose s different samples are drawn and a given characteristic 
observed for each one, and let there be r classes of this characteristic. Let 
n it be the frequency with which the it h class of the characteristic 
occurs in the y'th sample and the results be recorded in a table of the 
same form as Table 16.7, where B, is the y'th sample and A { is the ith 
class of characteristics. To test the null hypothesis that the s samples 
were drawn from populations having the same distribution, the method 
of evaluating y 2 is exactly the same as that of section 16.7.6 even 
though the mathematical model is different in that the column totals 
are not random but are the stated sample sizes. The number of degrees 
of freedom is now (r — 1)(at — 1) also. For, referring to the notation 
of Table 16.7, for the first sample, 

rW) = 11 2 i 

or (r — 1) degrees of freedom. 

For the y’th sample, 

Z 2 <nh sample) = I (»U ~ 
jth sample 

with (r — 1) degrees of freedom. 

Summing over ally = 1, 2, 3, . . ., s, 

* if n-iPi 
with s(r — 1) degrees of freedom. 
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If now p t , i — 1, 2, 3,. . ., r is estimated by the corresponding relative 
frequency n t .jn, 

r 2 _ v v ("» ~ n t .ln) 2 
X ff n.,n t .ln 

with s{r — 1 ) — (r — 1 ) degrees of freedom which is (r — l)(j — 1 ) 
degrees of freedom. 

Hence the procedure is: 

(i) find the expected frequencies E t] = n., n t .ln, 

(ii) calculate % a , and 

(iii) refer to the % 2 distribution with (r - l)(j - 1 ) degrees of freedom. 

Example The distributions of breaking tension levels of three different 
substances are as follows: 

| Sub I Sub II Sub in Totals 
Ti 140 161 133 434 

Ti 79 63 82 224 

T 3 31 36 25 92 

Totals 250 260 240 750 

Is the distribution of breaking tension level the same for the three 
substances? 

Solution The N.H. is that the breaking tension level is the same, i.e. 
all substances are from populations with the same distribution. The 
expected values are, 

E(I, 7\) = 250 x 434/750 
= 144-67, 

E(I, T 2 ) = 74-67, 

E(I, T 3 ) = 30-67 etc. 
and 

2 = 4-672 , 4 '33 2 0-33 2 4-442 

X ~ 144-67 + 74-67 + 30-67 + • ■ • + 29^44 
= 8-279. 

The number of degrees of freedom is given by (3 — 1)(3 — 1 ) = 4 . 
From tables, ■f is 9-49 which is greater than the observed value of 8-28^ 
and hence the result is not significant at the 5 per cent level. From the 
given data there is no reason to doubt the null hypothesis. 
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EXERCISE 16 

1 If x is normally distributed with mean [i = 10 and variance a 2 = 9, 
calculate the probability that (i) x > 11, and (ii) if a random sample 
of size 16 has mean x, that x > 11. 

2 A machine produces circular parts whose diameters are known to 
be normally distributed with standard deviation 0-019 m and the 
mean diameter is supposed to be 0-500 m. A random sample of ten 
is taken and the sample mean is found to be 0-488 m. Does the data 
from the sample indicate a change should be made in the machine? 

3 The mean range of a certain type of gun is 1 000 metre and the 
range standard deviation is 124 metre. After two years’ storage 
fifty rounds are fired and the mean range is found to be 872 metre. 
Test whether storage has changed the mean range. 

4 A firm claims that 88 per cent of the steel rods they produce have 
a breaking tension of more than 1 000 newton. A sample of 300 
rods was tested and only 249 rods had this breaking tension. Does 
this discredit the firm’s claim? 

5 The diameters of a set of ball bearings are assumed to be normally 
distributed with a standard deviation of 0-1 cm. What size sample 
should be taken in order that the estimate of the mean diameter of 
the ball bearings shall differ from the population mean by more 
than 0-005 cm only once in 100 times?. 

6 In a certain county in one year 90 boys entered school A and 90 
boys entered school B, after the results of tests in Mathematics and 
English. The mean of the marks of the boys entering school A was 
63 and that of the boys entering school B was 60. If the variance 
was 144 in both cases, is the claim that the entry requirements for 
both schools is the same, justified by these data? After one year the 
same boys were each given another common examination. In school 
A, 10 boys had left the school, so that 80 boys had an average mark 
of 61-5 with variance 900 and the 90 boys of school B had an 
average mark of 65 with variance 576. Can any conclusion be drawn 
about the superiority of school B1 

7 Two researchers X and Y each tested the growth of a sample of 
fifty plants of the same species over a fixed period of time. The 
following table shows their results: 



X 

r 

Mean growth in cm 

18-04 

19-38 

Variance in cm 2 

8-85 

9-12 


Does the smaller variance for X imply he is a better researcher than 
y? Determine whether the difference in the means is significant 
(i) at the 5 per cent level, (ii) at the 1 per cent level. 
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8 Two classes take the same test. Class A consists of six students and 
class B consists of eight students. The marks awarded are 

Class A I 830 829 827 828 826 826 

Class B I 828 823 824 827 824 826 824 823 

Test whether the mean marks of classes A and B are significantly 
different. 

9 Tests showed that of 100 women drivers and 200 men drivers, 30 
of the women and 75 of the men were not good drivers in that they 
failed the tests. Use these data to test the claim that women are not 
such good drivers as men. 

10 A sleeping drug and a harmless control tablet were tested in turn on 
nine patients in hospital and the number of hours’ sleep recorded 
as follows: 

Patient number 11 23456789 

Under sleeping drug 9-5 7-7 9-1 5-3 6-7 10-4 7'8 9 9 8-7 

Under control drug | 7-5 7-4 9-5 5-0 5-3 9-0 7-2 8 1 8-9 

Is the sleeping drug effective? 

11 Nine students were tested before and after being coached in mathe¬ 
matics and their marks were 

Student number 11 23456789 

Before coaching 55 61 63 47 76 45 48 60 62 

After coaching | 62 64 62 51 73 50 54 56 63 

Do these data show that the coaching was effective? 

12 The number of absences due to sickness in two factories over a 
period of six months was recorded as a percentage with the following 
result: 

Month I Oct. Nov. Dec. Jan. Feb. Mar. 

Factory A I 39 48 57 63 71 78 

Factory B \ 45 51 66 74 82 73 

Use the /-test to examine whether there is any significant difference 
in these two sets of data. 

13 Two machines are used to make identical rods. Random samples of 
ten parts from machine A and seven from machine B were measured, 
and the results recorded. 

Machine ,4 (mm) I 11 1 11-5 11-4 11-0 10-9 11 1 11-2 11-5 11-3 11-4 
Machines (mm) | 11-2 10-6 11-1 10-3 11-8 10-9 11-7 
Do these data indicate a significant difference in the mean lengths 
of the rods from the two machines? 

14 Capacitors of nominal value 100 pF are being produced and over 
a large number 15 per cent are rejected as being not of the required 
standard. Samples of 5 are taken at regular intervals and tested. 
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(i) Use the binomial expansion to calculate to three decimal places 
the probabilities of there being 0, 1, 2, 3, 4, 5 rejects in a sample. 

(ii) Calculate the probabilities of the various numbers of rejects 
using the Poisson approximation, (iii) Ten consecutive samples each 
of five capacitors contained 1, 2, 3, 0, 2, 3, 1, 3, 2, 2 rejects respec¬ 
tively. Use the probabilities calculated in (i) and (ii) to calculate 
two different values of f- to test whether the apparent increase in 
the percentage defective is significant. 

15 Test the goodness of fit of the following data from 500 trials of an 
experiment to a Poisson distribution whose estimated mean is 1-3. 

* 10 1 2 3 4 5 6 

Observed frequency | 132 180 121 51 10 5 2 

16 The cost per meal in canteens of different size, serving different 
numbers of meals per day, is listed in the following table. Do these 
data support the belief that larger canteens produce cheaper meals? 


Number of meals 
served 

Cost of meal in new pence 


under 30 30-40 

40-50 

over 50 

Totals 

100-499 

49 84 

74 

26 

233 

500-999 

40 77 

132 

59 

308 

Over 999 

43 76 

140 

103 

362 

Totals 

132 237 

346 

188 

903 


17 Test the % 2 goodness of fit of the distribution of problem 18, 
Exercise 15. 

18 Manual dexterity and success in theoretical examinations were 
graded over a period of years in applied science students. The 
following data were recorded. Test whether manual dexterity and 
success in examinations are independently distributed. 


Examination 

Manual dexterity 

a 

P 

V 

6 

w 

a 

1 369 

171 

1042 

399 

2 

P 

2 578 

475 

2 704 

933 

12 

y 

1 391 

421 

3 827 

1 843 

36 

s 

455 

256 

1 849 

2 507 

113 
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Applications 


17.0 INTRODUCTION 

In this chapter a number of physical problems are considered and 
mathematical equations or statements which attempt to describe the 
physical situation are derived. 

The aim is to indicate some of the ways in which the mathematics 
of the previous Chapters may feature as part of an engineering type 
problem. To take an example, the hyperbolic functions were introduced, 
by definition, in section 4.5 and their relationships with the trigono¬ 
metric functions shown in section 10.6 but the engineering student still 
may not appreciate the need to study such functions until some prac¬ 
tical use is described. Two illustrations of the way in which the hyper¬ 
bolic functions may feature are given in section 17.2.2 where they appear 
as the solution of a differential equation obtained from the model of 
a real situation. 

When an engineer is confronted with a problem it frequently happens 
that any attempt to describe the complete situation in mathematical 
terms leads to equations of too great a complexity. In this case the way 
forward is to replace the physical problem with a simpler model in 
which some of the variables have been ignored. The engineer may often 
be called on to use his skill and ‘know-how* to decide which variables 
should be retained and which omitted. The illustrations of the following 
sections may be looked upon as simple models of a more complicated 
real-life situation. For example, the spring systems of section 17.3.3 are 
idealised in that, first the damping/frictional effects are not taken into 
account, and second, the stiffness of each spring is presumed constant 
whereas it is possible for the stiffness to depend on temperature and 
therefore on the rate at which the displacements take place. These 
additional factors (and others) when taken into account lead to 
mathematical equations which would be excessively complicated and 
would obscure the basic fact that the eigenvalues can be related to the 
natural frequencies of oscillation. With this fact well-established the 
equations could be modified to take the additional factors into 
account. 

From a mathematical standpoint the solution of the mathematical 
equations is the solution of the problem, but from an engineer’s stand¬ 
point this solution is only one step towards solving the physical 
problem. The engineer must be able to interpret the mathematical 
solution in terms of the physical variables remembering that the 
mathematical one may have been derived from what was only a model 
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of the real situation. The mathematical solution may sometimes be 
used to assess the justification of the omission of certain variables. 

The problems considered here are simply a selection from a much 
wider range of problems of the type that the engineer could encounter 
both in his academic course and in practice. 

17.1 INEQUALITIES. LINEAR PROGRAMMING 

The engineering world is concerned very much with inequalities. It is rare 
that a measurement is precise, generally a variable is found to lie 
within a band of values and the symbols < or > are of frequent 
occurrence. The student should know the difference between equality 
and inequality and in certain circumstances this may more readily be 
observed in a graphical sense. For example the equation 

2y + 3x = 6 

may be represented in (x, j) space as a straight line of slope (—f) 
having intersects of lengths 2 and 3 on the .v and y axes, respectively 
(see Fig. 17.1). 



The equation 

2y + 3x < 6 

however, represents a region of the ( x , y) space and is shown by the 
unshaded region of Fig. 17.2. 

Likewise the shaded region is given by 2y + 3x > 6. 

A simple use of such graphical regions to represent inequalities 
equations and their use in solving a management problem is now given. 

Example A factory is to be designed to accommodate two types of 
machines to make a particular product and the manager has to 




APPLICATIONS 


465 



decide how many of each type to install. The specification is as 
follows: 



Floor area 

Operators 

Profit units 

Type A 

4000 

11 

9 

Type B 

7000 

8 

12 


The maximum factory area available is 56,000 and there are 88 
operators available to make the product on either machine. The factory 
must make a profit of not less than 90. How many of each type of 
machines should be used 

(a) to obtain the maximum profit? 

(b) to make the minimum number of redundancies? 

Solution Suppose a machines of type A and y machines of type B are 
installed then the following inequalities arise: 

Area 4a + 7y < 56 

Employment 11a + %y < 88 

a > 0, y > 0. 

If P is the profit then 

P = 9a + \2y > 90. 

Representing the various inequalities in (a, y) space and shading 
the unacceptable regions leads to Fig. 17.3. The unshaded region is the 
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‘operational zone’ and there are five operational ‘points’ within the 
zone, i.e. (1, 7), (2, 6), (3, 6), (4, 5), (5, 4) 



(a) the line 9 a + I2y = P is parallel to 9x + I2y = 90 and the 
profit P will be maximum when the perpendicular distance from the 
origin to the line 

P = 9x + 12 y 

is greatest. Naturally it is necessary for the line to pass through one 
of the operational points (x,y must be integers) and Fig. 17.3 shows that 
the line must pass through the point (3, 6). Under this condition 

* = 3, y = 6 
Area = 54,000 

Employed persons = 33 + 48 = 81 
Profit = 27 + 72 = 99 

There are thus, 2,000 units of area not used and 7 operators cannot be 
employed. 

(b) The minimum number of redundancies will appear when the line 

P = 9x+ 12 y 

parallel to 9 a + 12 y = 90 in the operational zone, passes through the 
operational point nearest to the line 1 1a + 8j = 88. 
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From Fig. 17.3 this is observed to be the point (5, 4). Under these 
conditions 

x = 5, y = 4 
Area = 48,000 
Employed persons = 87, 

Profit = 93 

Thus only one person is declared redundant (or is promoted) but 8,000 
units of area are unused and the profit is reduced to 93. 

Other possibilities appear on examining the Figure. One obvious 
possibility would be to operate at the point (5, 5) within the profit/ 
area zone but outside the labour zone. This leads to 

Area = 55,000 
Operators = 95 
Profit = 105 

This operational point would utilise practically all the area available, 
give a larger profit but would require the recruitment of 7 more machine 
operators. Whether this is a possibility would depend on other factors 
outwith the present consideration which may in turn lead to a further 
problem of this type. 

This type of problem which leads to a mathematical model as a set of 
m linear equations with n unknowns together with a target function 
which requires maximising or minimising is called a linear programming 
problem. When the number of unknowns is greater than 2 the graphical 
approach is impossible (with 3 unknowns the linear equations are 
represented by planes in 3-dimensional space and with 4 unknowns 
there is no graphical representation at all) and it is necessary to resort 
to other techniques. There are many techniques available with the 
general heading of linear programming and they usually start by setting¬ 
up the mathematical equation in matrix form. If a maximisation of some 
function is required, this is often achieved by iteration i.e. start with an 
initial estimate of the variables and then use some process repetitively 
to improve the estimate until eventually the optimal values are reached. 

The applications of linear programming in engineering are many 
and varied. A distribution problem may be such that different sources 
S u S 2 , S 3 , . . . hold stocks P u P 2 , P 3 . . . and supply a number of 

different consumers C u C 2 , C 3 . . . whose demands are D u D 2 , D 3 - 

In this case the cost of supplying consumer C, from source .S', may be 
written A if and the matrix (A tj ) is the cost matrix. The problem is to 
meet the demands of the consumers in the most economical way, i.e. 
to keep costs to a minimum. Such distribution type problems may be 
met where there is supply and demand of materials (raw or manu¬ 
factured) or services (electricity, gas, water). 
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17.2 EXPONENTIAL FUNCTION. HYPERBOLIC FUNCTIONS 
DIFFERENTIAL EQUATIONS 

The exponential function e kx was introduced in Chapter 4 as the solution 
of a particular differential equation 



This is an important differential equation which makes its appearance 
in many different contexts. 

17.2.1 Natural law of growth. The exponential function 

A variable is said to satisfy this law of growth if its rate of change is 
proportional to the value of the variable, i.e. if y is the variable, then 


Some simple ways in which this law appears will be illustrated. 

(i) Under certain conditions, populations of animals, people, or 
bacteria satisfy this law, i.e. that the rate of change of a population is pro¬ 
portional to the population. With a positive constant of proportionality 
k the population is then seen to have exponential growth y = A e kt , 
where A is constant. The shape of the curve of y is similar to the shape 
of Fig. 4.1 and increases rapidly for t > 0. If this represents the behav¬ 
iour of the world human population it gives good indication of the 
rapidity of the increase of population with time. Fortunately, there are 
indications that the growth law of human population is governed by 
an equation of the form dyjdt = ky(c — y), where c is a constant. This 
differential equation may be solved using the method of section 11.1.2 
and has the solution 


y = Acj {A + e _cfc( ) 


where A is an arbitrary constant. In this case the population has a 
limiting value given by 


Limit [Ac/(A + e -fcct )] 

= Ac/A = c. 

This population growth case is illustrated in Fig. 17.4. 

(ii) If a hot body at temperature T is allowed to cool towards the 
constant temperature T 0 of its surroundings, the law of cooling (as 
propounded by Newton) states that the rate of change of temperature 
is proportional to the temperature excess, i.e. 
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where A: is a positive constant dependent on the material of the body and 
its surrounding. Replacing T — T 0 by 0 then 



leads to 6 = T — T 0 = Ae~ kl . 

The cooling curve for a body of initial temperature T x is illustrated 
in Fig. 17.5. 



Fig. 17.5 
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(iii) Suppose a rope is passed round a rough circular bollard and 
forces applied at each end of the rope. The tension at any point of the 
rope may be evaluated as follows. Refer to Fig. 17.6 and consider the 



equilibrium of forces acting on the small element AB of the rope. The 
forces acting are T at point A, T + 6T at point B, overall reaction of 
the bollard R, and frictional force F. Hence for the small element of 
rope AB 

(T + dT) sin {$60) + T sin {$60) = R 
{T + ST) cos {$60) - T cos {$60) = F. 

In the case of limiting friction (i.e. where the rope is about to slip) 
then F = [xR where (i is the coefficient of friction between rope and 
bollard. 

Replacing the trigonometric functions by the first term of their 
Maclaurin expansions, i.e. sin $60 $60, cos $60 — 1, and neglecting 

products of small quantities leads to 

6T^=F^fi(2T+ 6T){$60) 

== =ftT60 

so that 6T/60 — fiT and, in the limit as 60, 6T-+ 0, 



This is the natural law of growth and so the solution is T = A e" 8 . 
Now, T = P 0 when 0 = 0, and T = P 1 when 0 = <j>, and hence 

P 1 = P 0 e"*. 
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If two turns are taken around the bollard then <j> — 4n and, if the coeffi¬ 
cient of friction is taken as j, then P 1 = P 0 e 14 ^ = P 0 e Tr , i.e. P± — 23 P 0 . 
Thus for 2 turns on the bollard a pull at one end will sustain a tension 
23 times as large at the other. This is the basis for the ship’s capstan 
and the band-brake. 

17.2.2 The hyperbolic function 

The hyperbolic functions were introduced in Section 4.5 of Chapter 4 
and used in deriving the solution of some differential equations in 
Chapter 6, the relationship between the trigonometric and the hyper¬ 
bolic function was shown in Section 10.6 and it is because of the con¬ 
nection with the well-known sine and cosine that the basic hyperbolic 
functions are called hyperbolic sine and hyperbolic cosine. The con¬ 
nection with the hyperbola is not quite so obvious. 

The curve of intersection of a right-circular cone with a plane is 
called a conic section. The three basic sections are the ellipse, the para¬ 
bola, and the hyperbola. In Fig. 17.7, the three types of intersection 
are shown: 





(a) (b) (c) 

Fig. 17.7 


Fig. 17.7(a) shows the section when the plane intersects the cone in a 
closed curve. This section is called the ellipse. Fig. 17.7(b) shows the 
situation when the plane is parallel to the side of the cone giving an 
open curve called the parabola. The Fig. 17.7(c) shows the intersection 
curve that has two branches which together form the hyperbola. 
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It is shown in books of analytical geometry that the ellipse, referred 
to its symmetrical axes (see Fig. 17.8(a)), has an equation 



Fig. 17.8 

whilst the hyperbola, Fig. 17.8(b), has the equation 



The two lines AB and CD towards which the hyperbola approaches at 
infinity are called asymptotes and have equations y = ±(b/a)x. 

When b = a, the ellipse has the equation x 2 + y 2 = a 2 and the 
particular symmetric shape is a circle. The equivalent hyperbola (with 
b = a) has the equation x 2 — y 2 = a 2 and is called a rectangular 
hyperbola (i.e. one whose asymptotes, of equations y — ± x, cut at 
right angles). 

It is well known that the circle can be represented by its parametric 
equations x = a cos Q, y = a sin 6, since cos 2 0 + sin 2 0 =1, and for 
this reason the trigonometric functions sin 0 and cos 0 are sometimes 
called the circular functions. In like fashion the parametric equations of 
the rectangular hyperbola are x — a cosh <j>, y = a sinh <f>, since 
cosh 2 </> — sinh 2 </> = 1. The two functions sinh<£ and cosh</> are then 
called hyperbolic functions. 

The hyperbolic functions are obviously important from a geometrical 
and mathematical viewpoint but they are important functions that also 
occur in a natural way as the following illustrations show. 

(i) The catenary 

If a heavy chain is suspended under gravity between two points, the 
curve that the chain forms is called a catenary, e.g. overhead electricity 
cable, or telegraph wire. 
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Fig. 17.9 


Let i be a distance measurement along the chain and consider the 
equilibrium of a small portion 6s, i.e. AB of the chain. The three forces 
acting are the weight ads, where a is the weight per unit length, and T, 
and T + ST, the tensions at the ends A and B. Resolving forces on 
AB, horizontally and vertically, 

( T + ST) cos(0 + 66) - T cos0 = 0, 

(T + 6T) sin(0 + 66) — T sin0 = ads 

where tan 0 is the slope of the catenary at A. Expanding sin(0 + 00) 
and cos(0 + 00) and using 

sin 00 — 00 
cos 00 ^ 1 

these two equations reduce to 

0Tcos0 - Tsin0 00 = 0 
6T sin0 + T cos0 66 = ads 

after neglecting products of small quantities. 

Proceeding to the limit as 66, 6T^-0 leads to two differential 
equations 

sin0 ^ = —T cos0 + a ^ 
da dv 

The first of these differential equations can be integrated immediately 
(see Section 11.1.2) since the variable are separable and gives 

T cos0 = constant = T 0 (say) 
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Elimination of dTjdd and T in the second differential equation leads to 



This differential equation can be expressed in terms of x and y (see 
Fig. 17.9) by using the fact that 

dyjdx = tanO 


from which it follows that 


% = 7x (x »*>-*>*>% 


d 2 y _ a ds 
ie> d? = ¥ 0 Tx 

Writing a/T 0 = Ijc and using 

dsjdx = seed = Vl + tan 2 0 = Vl + {dyjdx) 2 
the differential equation for y becomes 

d 2 v 1 __ 

;£3 = c Vl + ( d yl dx ) 2 - 


This may be solved by putting p = dyjdx, as in Section 11.2.1, i.e. 


£ 

dx 


= l ~Vl +p 2 


(Jl _If* 

JVl+p 2 cj 
sinh - 1 (p) = xjc + A 
p = sinh (A + xjc). 


Suppose that the j-axis of Fig. 17.9 is now made to pass through the 
lowest point of the catenary then x = 0 when p = dyjdx = 0, i.e. 
A = 0 and hence 

p = dy/dx = sinh {xjc). 

Integrating again, 

y = c cosh {xjc) + B 
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Locate the x-axis at a depth c below the lowest point of the catenary, 
i.e. y = c, x = 0, then B = 0. Hence y = c cosh x/c is the equation 
of the catenary curve with axes as shown in Fig. 17.10. 



Thus the catenary is the shape of the hyperbolic cosine curve. This shape 
is frequently seen in practice, e.g. telegraph wires, electricity power 
cables. 

The length j of the curve may be found by integration. From 

it follows that 


s -j7' + (£)’* 

= j J l + |sinh ^ dx = J cosh (x/c) dx. 

Hence s = c sinh x/c + D. 

Measuring s from the lowest point, s = 0 when x = 0 hence D = 0 
and s — c sinh x/c. 

Thus the length of the catenary y — cosh x/c from its lowest point is 
the hyperbolic sine function s = c sinh x/c. 

(ii) The cooling fin 

In many physical situations it is necessary to extract the heat from a 
hot liquid (or gas), e.g. a heat exchanger, a cooling system. In the 
simplest situation the hot container simply radiates heat into the cold 
surroundings (the latter being maintained by a flow of cold air or other 
medium). The container is frequently a metal pipe and in order to 
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increase its surface area it is provided with cooling fins. A typical pipe 
is shown in Fig. 17.11. 



The cooling fin may take a variety of different shapes the cross-section 
being rectangular, triangular, or parabolic. Consider the case of a fin 
of constant rectangular cross section and the flow of heat through it. 
Fig. 17.12 shows the typical cross-section. Let 7" be the temperature at 



distance a along the fin and q n be the quantity of heat flowing over the 
cross-section of area A, let q n + dq n be the heat flowing over the cross- 
section of the fin at distance x + dx and let 8q e be the heat flowing into 
the cooling medium from the section of thickness 5x. Let / be the length 
of the fin. The heat balance equation over the section of width bx at 
distance x is 


q n = (q n + bq n + dq c ) 
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Now, q n is a function of x only and hence (see equation (7) of section 
5.1.3), 


and it follows that 



There are two basic empirical laws of heat transfer: 
(i) the law of conduction which states 




A 


where K is a constant dependent on the thermal properties of the fin 
material and, 

(ii) the law of convection (Newton’s law of cooling), which states 
6q c = h(T - T s ) (Cdx) 

where h is a constant dependent on the surface properties of the fin and 
on the cooling medium; T s is the (presumed constant) temperature of 
the cooling medium and C is the circumference of the fin at distance x. 

Thus, K T-T.)CI^-^Sx=+K^(A d f)d X . 


Normally A and C are dependent on the distance jc but if the fin is a 
rectangular one they are both constants and in the limit, as dx -> 0, 
the equation becomes 


<PT 
dx 2 


-AT- T t ) 


The assumption that T s is constant implies that the heat is transferred 
to the surrounding medium and immediately transported elsewhere. 
In practice something akin to this idealised situation is achieved by a 
rapid flow of cooling medium over a hot fin. For example, the sump in 
certain motor cars is often of ribbed shape approximating to a series 
of fins and the surrounding cooling air is usually moving past the fin 
rapidly. 

It is convenient to replace T — T s by 0 and hence reduce the equation 
to 



where k = •%/( hC/KA) is a constant. 
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The differential equation is easily solved (see Section 11.4 with 
k, (i = +k, — k ) and 6 is given by 

6 = Dt kI + Ee- kx 


where D and E are arbitrary constants of integration. 

In order to evaluate these arbitrary constants it is necessary to fit 
boundary conditions to the problem. If the temperature T at the base of 
the fin is taken as T 0 then one boundary condition has the form 
6 = T 0 — T s = 0 o , at x = 0. 

A second boundary condition may be obtained by neglecting the heat 
loss at the extremity of the fin x = /, i.e. by assuming that (q n ) x =, is 
zero. Use of the law of conduction (see (i) above) gives the second 
boundary condition in the form 


dT 

dx 


l. 


Insertion of these two boundary conditions into the equation for 6 gives 
6 0 = D + E 
0 = D e kl — Ee~ kl 

Solving for D and E and using the fact that 
e* q. e -<o = 2 cosh <f> 

quickly leads to 

0 = 6 0 cosh k(l — x)/cosh kl 
i.e. T-T s = (T 0 - T s ) cosh k(l - *)/cosh kl. 

The variation of temperature across the fin length thus follows an 
hyperbolic cosine curve as in Fig. 17.13. 



Fig. 17.13 


oc=l cc 
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Fins of shapes other than rectangular may be considered in a similar 
manner, the differential equations so arising being of additional 
complexity. It may be shown that the ideal shape (for optimum heat 
transfer) is of a parabolic cross-section and this is the shape frequently 
seen on practical fins. 

17.3 MATRICES. DETERMINANTS. EIGENVALUES AND 
EIGENVECTORS 

Engineering situations, when described by a mathematical model, 
frequently lead to a set of algebraical or differential equations in a 
specified number of unknown quantities. Matrix algebra may often be 
used to simplify the sometimes complex array of information. 

17.3.1 A simple electrical network 

Suppose an electrical network consists of a number of resistors together 
with batteries of given electromotive force (e.m.f.). It is required to 
find the current flowing in the various branches of the network. The 
Fig. 17.14 illustrates such a circuit the resistor values are in ohms and 
the e.m.f. in volts. 



Let the current (in amperes) be represented by cyclic currents I lt I 2 , 
and / 3 flowing in the three branches of the circuit, e.g. the current 
flowing in the resistor of 5 ohms is (/j — / 2 ). 

The basic Kirchoff laws that govern the network are: 

(i) for each closed circuit the total, algebraic sum of e.m.f. is zero, 
and 

(ii) at each nodal point there is continuity of current. 
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The law (ii) is automatically satisfied by using cyclic currents, for 
example at modal point A the current flow is 



which shows that the current flow into the node is exactly balanced by 
the current flow out. 

The law (i) leads to the following set of algebraical equations 
10 = 6(4 - / 3 ) + 5(4 - 4 ) + 34 
3 = 74 + 5(4 - 4 ) + 2(4 - 4 ) 

5 = 2(4 - 4 ) + 6(4 - 4 ) + 104 

This set of equations reduces to the following 
144 - 54 - 64 = 10 
-54 + 144 - 24 = 3 
-64 - 24 + 184 = 5 

or in matrix form 



This is of the form 

RI = E 

where R is the resistance matrix, 

I is the current matrix, 
and E is the emf matrix. 

Note that the matrix equation E — RI is the generalised form of the 
the first Kirchoff law. 

To find the current, the matrix equation is solved for / either by using 
the inverse matrix R 1 so that 

/ = R'E 

or by employing Gaussian elimination. 

17.3.2 Forces in a frame network 

Suppose Fig. 17.15 represents a simple framework structure with 
hinged joints. It is assumed that there is no bending in any member 
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and that the external forces are applied at the joints. It is required to 
find the reactions at the supports and the bar tension (or compression) 
forces. 

Framework of this type are of common occurrence in the analysis 
of structures (e.g. girder bridges, cranes, skeleton of buildings) and are 
usually called trusses. 



Fig. 17.15 


In the framework of Fig. 17.15 there is an external force applied at 
P having a component 50kN acting in the negative z-direction and 
another component 100 kN in the ^-direction. The other numbers 
represent the lengths in metres and the bar members of the truss are 
labelled (1), (2) and (3). 

There are three reactions at each of the supports and each reaction 
has three components in the x, y and z-directions. In addition there are 
three tension forces along the three bars. In all this gives twelve un¬ 
known forces (3x3 + 3), and the appropriate number of equations 
is obtained by balancing force components at each joint giving 4 x 3 = 
12 equations. 

Let U Fin Fig. 17.6 represent a typical bar member with an (assumed) 
tensile force F (i.e. internal bar force directed away from the joint). Let 
the forces have components (P 1 P 2 P 3 ) and (R 1 R 2 R 3 ) respectively. 
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Balancing the forces at joint U gives 

Px + F cos « = 0 
P 2 + Fcos /S = 0 
P 3 + Fcos y = 0 

where (cos a, cos /?, cos y) are the direction cosines of the line UV. In 
structural engineering it is usual to work in terms of F/l where / is the 
length of the bar UV and this ratio, denoted by T, is called a tension 
coefficient. Thus at joint U the force balance equations become 

Px + Ta = 0 
P 2 + Tb = 0 
P 3 + Tc = 0 

where a, b, and c are the projections of UV onto the axes of x, y, and z 
respectively. The actual bar force F is then found by multiplying T by 
the length of the bar. 

The Fig. 17.17(a), (b) show the plan and elevation force diagram for 
the truss of Fig. 17.15. The tension coefficients are 71, 71, and T 3 and 
the reaction force components are {PxP 2 Pz) at A ; {RxR 2 R 3 ) at B, and 
(SxS 2 S 3 ) at C respectively. The external forces are indicated and the 
other numbers on the Fig. 17.17 indicate the lengths. 

Balancing forces at each joint in turn leads to the following equations: 

at A, Px - 5Tx = 0 

P 2 + 1071 = 0 
P 3 + 3071 = 0, 
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at B, Ri + 10 T 2 = 0 

R 2 + =0 

R 3 + 3071 = 0, 
at C, S x + 10 T a = 0 


S 3 + 3071 = 0, 

at P, 571 - 10 T 2 - 10r 3 = 0 

ioo - 1071 + ior 3 = o 

-50 - 3071 - 3071 - 3071 = 0, 


| 50 kN 



\(3) 

! 

A. T 

] 100 kN 

10 

i 

(2) T 2 X 

d P 

R, 


10 

! s\ 5 


Fig. 17.17 


Naturally, with this simple truss it is possible to solve without resort 
to matrix analysis but it should be realised that the force balance 
equations of much more complicated networks can be derived by using 
this simple technique. When the number of unknowns (and equations) 
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is large it is often an advantage to set up a matrix. The set of 12 equations 
may be represented by the matrix equation 


M -5 \ 

lPx\ 


1 0 \ 

1 10 

P 2 


0 1 

1 30 

P 3 


0 

1 10 

R 1 


0 

1 

r 2 


0 

1 30 

r 3 


0 

1 10 

s i 


0 

1 -10 

' s 2 


0 

1 30 

S 3 I 


1 0 

5 -10 -10 

Tx 


0 

10 -10 

T 2 

\ 100 

\ 30 30 301 

\ t 3 I 

\ — 50 


In this form it is easy to reduce the set of equations by Gaussian 
elimination (see Section 8.6.1). The solution is found to be 

T 3 = -100/9, T 2 = 95/9, 7\ = -10/9, S 3 = 3000/9, 

S 2 = - 100/9, S x = 1000/9, R 3 = -2850/9, R 2 = 0 

R x = -950/9, P 3 = 300/9, P 2 = 100/9, P x = -50/9. 

The actual bar forces are 

Fi = - j X VT025, F 2 = jX VIooo, 

and F 3 = — 100/9 x V1100. The negative signs of F y and F 3 show 
that the forces in bars (1) and (3) are compressive. The signs (as well 
as the magnitude) are very important because it is much more difficult 
to construct a bar to sustain a compressive force than to sustain a 
tension force. Hence in this structure particular attention must be given 
to bars (1) and (3) or alternatively the structure should be redesigned 
in order to ensure that all bar forces are of a tensile nature. 

17.3.3 Vibration network. Eigenvalues 
A. Mechanical spring system 

The equation of motion of the simple mass/spring system of Fig. 17.18 
using the notation of the Figure, in which l 0 is the unstretched length 



APPLICATIONS 


485 


of the spring, is given by expressing Newton’s second law of motion 
(for constant mass) in the form Force = mass x acceleration. 


The minus sign indicates that the spring force T is a restoring force. 

For a simple spring, Hooke’s Law (an empirical law) states that the 
tension T is proportional to the extension from its unstretched length, 
i.e. T = kx where k is a spring constant called the stiffness. (Note. This 
is sometimes expressed in the form T = Xxjl 0 where X is another 
constant called the modulus of elasticity). Eliminating T leads to the 
differential equation 


d 2 x 

n^- 2 +kx = 0 


or ^2 + n 2 x = 0, n 2 = k/m 

The solution of this differential equation is 
x = A sin (nt + a) 

where A and a are two arbitrary constants to be determined using the 
initial conditions (i.e. the conditions under which the motion is 
started). The important point here, however, is that the vibration of the 
mass m is sinusoidal and the angular frequency of the oscillation (the 
pulsatance ) is given by n. The period of the oscillation is 2 tt/« and the 
actual frequency is n/2n. 

Consider now a number of such spring networks linked together. 
The vibration in one mass will cause sympathetic vibrations in the 
other masses, i.e. there will be an interaction between the individual 
networks. For simplicity consider three such spring networks linked 
together as in Fig. 17.19(a) in which the three masses are m u m 2 and m 3 
and the connecting springs are of stiffnesses k lt k 2 , k 3 and fc 4 . The 
system is fixed at each end and the masses vibrate longitudinally. 
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The Fig. 17.19(b) shows the system in motion. The displacements x t , 
x 2 and x 3 of the masses are measured from the static equilibrium 
positions and to simplify the subsequent analysis it is assumed that in 
static equilibrium the springs are in their unstretched state. The 
equations of motion of the three masses then take the form 

-7i + T 2 = fflj d 2 xjdt 2 
-T 2 + T 3 = m 2 d 2 x 2 /dt 2 
-T 3 + T i = m 3 d 2 x 3 /dt 2 

where T u T 2 , T 3 , and T± are the tensions in the springs. 

Hooke’s law expresses the tension forces as 

7i = k x (x i — 0); T 2 = k 2 (x 2 — x x ); T 3 — k 3 (x 3 — x 2 ) 
r 4 = ki(0 - x 3 ) 

Eliminating 7), T 2 and T a leads to three differential equations given by 
m x x 3 = —{k 1 + k^Xi, + k 2 x 2 
m 2 x 2 = k 2 x 3 — (k 2 + k 3 )x 2 + k 3 x 3 
m 3 x 3 = k 3 x 2 — (k 3 + ki)x 3 
where x means d 2 xldt 2 . 

Now look for that particular mode of oscillation in which all masses 
oscillate at the same frequency «/2 tt but with differing amplitudes. 
This particular, oscillation is called the normal mode. 

This assumes that 

x x = A 3 sin (nt + a) 
x 2 = A 2 sin (nt + a) 


x 3 = A 3 sin (nt + a) 


and means that whatever configuration the system may have at some 
time t 0 it returns to the same configuration after a period of In/n, i.e. at 
time t 0 + 2tt/h. Inserting the assumed values for x u x 2 and x 3 into the 
differential equations leads to the set of algebraical equations in A u A 2 
and A 3 


—n 2 A t 


k 3 

m 3 


A 2 


(J^±Ma 3 

m 3 


In matrix form this set of equations may be written 
CA = IA 
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where C is the coefficient matrix, A is the amplitude matrix and A = n 2 , 
i.e. 



Note that A is the form of a vector and is often called the amplitude 
vector A. The equation CA = AA is of the form of (84) of Section 
9.11, i.e. A = n 2 is the eigenvalue of the matrix C and to each eigenvalue 
there will correspond a value of the frequency n. Thus for this system 
there will, in general, be three different modes of oscillation, i.e. three 
natural frequencies of oscillation of the system given by y/X. Each mode 
of oscillation will give a different eigenvector which is the amplitude 
vector A. 

In order to examine such a system in detail consider the above system 
with 

m 1 = 6, m 2 = m 3 = 4 

= k 2 = k 3 = 3, and = 1. 

The coefficient matrix C is then 


/ 1 -1/2 0\ 

C = I -3/4 6/4 -3/4 

\ 0 -3/4 1 / 


and the characteristic equation for X takes the determinantal form 


1 — X -1/2 0 

-3/4 6/4-A -3/4 
0 -3/4 1 - A 


2 - 2A -1 0 

or -3 6 - 4A -3 = 0 

0 -3 4 - 4A 


This equation reduces after the usual determinant evaluation and 
subsequent factorisation to 

(A - 1)(4A - 9)(4A - 1) = 0 
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The eigenvalues are thus 

A x = 1/4, A 2 = 1, A 3 = 9/4, 
and the associated non-normalised 
for /.! = 1/4, 

A (1) = 

for A 2 = 1, 

A (2, = 

for A 3 = 9/4, 

A <3) = 


When the system is oscillating in its first normal mode A = A x = 1/4, 
i.e. at its least natural angular frequency n 3 = \ the amplitude of the 
displacements are in the ratio [2:3:3] and so the three displacements 
x u x 2 , and x 3 may be written 

Xi = 2 a sin (4/ + a) 
x 2 = 3a sin (\t + a) 
x 3 = 3a sin (it + a) 

where the values of a and a are to be determined by using the initial 
conditions. The period is 4ir and is the greatest of the three normal 
modes. The motion is illustrated in Fig. 17.20(a) which shows the case 
for a = 0. 

When the system oscillates in its second mode with A = 1, i.e. at a 
natural angular frequency n 2 = 1, the displacements are given 
x x — — b sin (t + /?) 
x 2 = 0 

x 3 = ~b sin (t + £). 

In this mode the centre mass is at rest and the other two execute 
simple harmonic motion at a period of 2-it. The values of b and fS are 
again determinable from the initial conditions. This mode is illustrated 
in Fig. 17.20(b) in which ft is taken as zero. 

Finally, when the system oscillates in its mode of highest frequency, 
i.e. A = 9/4, n 3 = 3/2, the displacements are given by 

x 3 = 2c sin (3r/2 + y) 
x 2 = —5c sin (3r/2 + y) 
x 3 = 3c sin (3//2 + y). 


eigenvectors are as follows: 




APPLICATIONS 


489 


The period of the oscillation is 47 t/ 3 and the constants c and y are 
again determined from the initial conditions. This motion is illustrated 
in Fig. 17.20(c) (for y = 0) and shows that the outer masses oscillate 
in phase but of differing amplitudes whilst the inner mass oscillates 
with a large amplitude and exactly 180° out of phase. 



These motions are, naturally, subject to any physical limitations of the 
springs and assume that the springs can be compressed as well as 
extended. 

If the system is set in motion in an arbitrary manner (e.g. by initial 
displacement and/or velocities of the masses) then the subsequent 
displacements x t will be a linear combination of the three normal mode 
oscillations. The three displacements at time f will then be given by 

x 1 = 2 a sin Qt + a ) + b sin (t + ft) + 2c sin (If + y) 

x 2 = 3a sin (|f + a) - 5c sin (If + y) 

x 3 = 3a sin (if + a) - b sin (f + ft) + 3c sin (If + y) 

The six unknowns a, b, c, a, ft, y will be determined by six initial 
conditions (i.e. will be determined once the initial motion is prescribed). 
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For example if the system is started into motion by displacing the first 
mass a distance h, keeping the other two masses in their equilibrium 
positions, and releasing it from rest, the initial conditions would be 

jci = h, x 2 = 0 = x 3 , = x 2 = x 3 — 0 at t - 0. 

These six conditions applied to the above set of equations lead to the 
six equations 

h = 2a sin a + b sin (S + 2c sin y 
0 = 3a sin a — 5c sin y 

0 = 3a sin a — b sin /f + 3c sin y 
0 — a cos ol + b cos /? + 3c cos y 
0 = f a cos a — t^c cos y 

0 = f a cos a — b cos /? + f c cos y 
from which it follows that 

a = HIS, b = 3 A/5, c = 3A/40, a = 0 = y = ir/2. 

B Electrical networks. Filters 

The governing equations of a simple electrical circuit connecting an 
inductance L, and a capacitance C as in Fig. 17.21 are given by 



Fig. 17.21 


where Q is the charge on the condenser at time t and I is the current 
flowing in the circuit. 

Eliminating I, gives 

Q + n 2 Q = 0, n 2 = 1/EC 
where Q means d 2 Q/dt 2 . 

The solution of this differential equation is Q = A sin (nt + «) and 
leads to I = nA cos {nt + a). 
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It may be observed that the L — C electrical circuit is the analogue 
of the mass-spring mechanical circuit. The differential equations are 
identical and it is possible to draw analogies between the components 
of the two circuits as in the following table. 


Electrical 

Mechanical 

Inductance, L 

Mass m 

(Capacitance) -1 , 1/C 

Stiffness k 

Charge Q 

Displacement jc 

Current / = Q 

Velocity v = x 

Applied emf E 

Applied force F 

Resistance R 

Damping p 


In both the mechanical circuit and the electrical circuit discussed in 
this section the effect of damping (i.e. resistance or friction) is omitted. 
Also the driving forces of emf or applied force are also not considered. 
Both of these effects can however be taken into account. In a real 
engineering situation it is unlikely for example, that damping (resistive 
or frictional) will be absent. Nevertheless before these additional 
effects are included in the analysis, it is extremely useful to have con¬ 
sidered the properties of, and important results obtained by, linking 
together simple undamped, unforced circuits. 

The strong analogy between the basic mechanical mass/spring system 
and the inductance/capacitance electrical circuit is the basis of the 
analogue computer. 

The effect on the mechanical system of varying the mass and the 
spring stiffness can be visually represented by making an equivalent 
electrical circuit and varying the inductance and capacitance. The latter 
components can be varied very easily whereas it may be difficult to 
vary the equivalent components in the mechanical system. This analogy 
between mechanical and electrical circuits can be extended in scope to 
include complicated systems. For example the engineer confronted 
with an unwanted vibration (i.e. displacement and/or velocity) in a large 
and complicated mechanical system can frequently represent it by an 
equivalent complicated electrical circuit. Since the electrical circuit 
can be easily constructed in the laboratory the engineer will be able to 
assess what effect the varying of component values will have on the 
unwanted vibration and hence by analogy be able to assess the effect 
on the real mechanical system and hopefully be able to modify the 
mechanical system to remove the unwanted vibration. 

Returning now to the simple L — C circuit, suppose the circuit 
incorporated a switch which was held open with the condenser charged 
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to a value Q 0 . If the switch was closed at time t = 0, the initial con¬ 
ditions would be 

Q= Q 0 , / = dQldt = 0, at t = 0. 

Insertion of these conditions into the solutions for Q and / leads to 
Q 0 = A sin a 
0 = nA cos a. 


and hence a = 7r/2, A = Q 0 . 

The charge and current at time t after the switch closure are then 
Q = Q 0 cos nt, I = —nQ 0 sin nt. 

The charge on the condenser and current flowing in the network both 
oscillate sinusoidally with a natural (angular) frequency n(=l/yXO- 
In the absence of resistance there is no damping effect and the oscilla¬ 
tions are sustained for all time. 


L L 



Now suppose two such circuits are linked together as in Fig. 17.22. 
and let h and / 2 be two cyclic currents, then the governing equations 
are 


dly 0 ! 

L 7l + T = 0 ’ 


h~h = 


d_Q i 
dt ’ 


dQ, 

dt 


where Q x and Q 2 are the charges on the two condensers. 

Eliminating the currents A and / 2 and rearranging the equations 
leads to 

LC Qi + 2 Qi~Q 2 = 0 
LC Q 2 - Qi + Q 2 = 0. 
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To find the natural frequencies in this case it is assumed that Q 1 and 
Q 2 are oscillatory with frequency n[2ir, i.e. such that 

0i = A 1 sin («/ + a) 

0 2 = A 2 sin (nt + a) 

With this assumption the differential equations may be rewritten as a 
matrix equation 

U -!) C£)-*(i) 

where A = LCn 2 . 

It is noted once again that A is the eigenvalue of the coefficient matrix 
and that the natural frequency is given by \/(A/LC). The characteristic 
equation for A is 


i e. A 2 - 3A + 1 = 0. 


The two eigenvalues are thus 
3_ V5 
= 2 2 


A, 


■U4 


If three such circuits are linked together as in Fig. 17.23 the differential 
equations are 


LC + 201-02 =0 

LC 02-0! + 202 - 03 = 0 
LC & - 02 + 03 = 0. 


L L L 



Fig. 17.23 


and the natural frequencies of oscillation are given by nj2n where 
n 2 LC = A, and A are the eigenvalues of the matrix 
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The characteristic equation for X is P — 5it 2 + 6X — 1 = 0 and the 
three values of X are found to be real roots X x , X 2 , and X 3 such that 
0 < A, < 1; 1 < X 2 < 2; 2 < X 3 < 3. 

The numerical evaluation of these roots and hence the natural 
frequencies of oscillation will not be undertaken because it is better to 
investigate this particular circuitry by a different technique. Suffice to 
state that by an alternative approach it may be shown that the N 
eigenvalues X u X 2 , . . X N for N linked circuits are given by, X r = 
4 sin 2 (ja,I 2), where p, = (2r - l>r/(27V + 1), r = 1, 2, . . N. 

The natural (angular) frequencies of the oscillations are then given by 

n T = (2jVLC) sin (u/2). 

For example, if N = 2, the two frequencies are 

fh = (2/VlC) sin (77/10); n 2 = (2/VZc) sin (3 tt/ 10) 
and if N = 3, the frequencies are 

n x = (2 /vTC) sin (77/14); n 2 = (2/VZC) sin (3 t7/14); 

« 3 = (2 IVIC) sin (577/14). 

The reader may check the validity of these values by inserting the 
appropriate numerical values of X into the above cubic equation. 

In the general case of N circuits 

= (2/VLC) sin (77/2(2 N+ 1)); 
n 2 = (2/VLC) sin (377/2(2^+ 1)), . . ., 
n N = (2/VLC) sin {(2 N - l>r/2(2Ar + 1)}. 

It may be noted that as the number of linked circuits increases, i.e. 
as N -»■ 00 , the lowest and highest frequencies have limiting values, i.e. 

2 . 77 

Lim . sin », — 0 
Ar->® VjLC 2(2 N + 1) 

T . . ((2N - iy\ _2_ . 77 _2_ 

“ d VLC Sm ( 2(2JV + 1) ) = vTC Sm 2 ~ VLC 

The Fig. 17.24 shows the frequencies n r ,r= 1,2. . . N for different 
values of N. 

This system with N simple linked circuits has its natural frequencies 
within a frequency range 0 to 2/VLC. Thus if, for example, a combina¬ 
tion of currents at different frequencies are passed into such an elec¬ 
trical network it will readily accept the currents whose frequencies lie 
within this band but will attenuate those outside. Such a network can 
be used in electrical engineering as a low-pass filter circuit, i.e. a circuit 
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that will pass the low frequencies in the range 0 to 2/VZc but reject 
those higher than this range. 

If the capacitance C and inductance L are interchanged in the simple 
circuits then it is easily shown that the frequencies within the band 0 to 
2/Vlc are attenuated whilst those outside this band are readily 
accepted, and the network becomes a high-pass filter. 

By a judicious choice of components it is clearly possible to make the 
natural frequencies of a network lie within a specified band of values 
and so make an electrical device known as a band-pass filter which 
will readily accept only these frequencies within the specified band. 



The low-, high-, and band-pass filters have many applications in 
electrical engineering, e.g. radar, television reception circuits, telephony 
and telegraphy noise suppression circuits, record player scratch filters. 

It has been noted however that electrical filters as described have 
differential equations equivalent to those of some mechanical spring 
networks and it may also be stated that additionally the equations 
governing acoustical apparatus also have similar differential equations. 
Hence by analogy it should be possible to design mechanical spring 
networks or acoustical devices that permit acceptance or rejection of 
certain frequencies of oscillations. For example, a mechanical machine, 
may have its unwanted oscillations ‘filtered out’ by mounting the 
machine in flexible spring supports and similarly the unwanted acoustic 
oscillations of a car exhaust may be minimised by using a series of 
linked resonating cavities mounted along the exhaust pipe, i.e. a car 
silencer. 
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17.4 VECTORS 

Vectors make their appearance in an engineering context in a number 
of different fields but particularly when the engineering system is in 
motion in 2 or 3 dimensions. The vectors of force, acceleration, 
velocity, and displacement are easily understood and the various 
mathematical equations describing the desired physical motion are 
readily expressible in terms of these vector quantities. As an illustration, 
if a particle of mass m is travelling with a velocity v and is subjected to 
a number of forces F 1; F 2 . . . then the vector form of Newton’s second 
law stating that the total impressed force is proportional to the rate of 
change of momentum is expressible as 
d 

F = SF, = — mv) 
dt 

This differential equation could be, for example, the fundamental 
equation of a rocket of varying mass subjected to various gravitational 
and thrust forces. 

Vectors are also useful in describing a physical system in statical 
equilibrium under a system of forces. The framework type problem of 
17.3.2 could, for example, have been described mathematically as a 
number of vector force balance equations. 

In this section the application of vectors to the field of crystallo¬ 
graphy is considered and it will be seen how the earlier work of sections 
9.7 and 9.9 is important in the description of crystal geometry. 

This field is studied by many engineers but particularly by the 
metallurgists and chemical engineers. It is found that the vectors of 
displacement and direction are of importance. 

17.4.1 Crystallography and three dimensional geometry 

The essential characteristic of a crystalline material is the periodic 
nature of its internal structure. The atomic arrangement is related to a 
network of points in space known as a space lattice. The points of the 
lattice may be occupied by single atoms, but usually in a crystal they 
are points at which there are identical groups of atoms. In a perfect 
crystal the outlook from any lattice point is the same as from any other. 
Starting from an arbitrary lattice point O in Fig. 17.25 it is possible to 
draw three straight lines each of which joins with three other lattice 
points, distance a, b, and c away, and form a set of coordinate axes 
(not necessarily at right angles to each other). 

The distances from the starting lattice point 0 to the other lattice 
points may be different, i.e. a^b c. Instead of unit vectors along the 
three axes it is usual to form base vectors a, b, and c so that any point 
P of space is specified by its position vector r and r = xa + yb + zc. 
The point P is then designated by (x,y, z). The notation [x,y, z], with 
square brackets, is used in crystallography to indicate the direction of the 
vector r. The three axes are called the crystal axes. Naturally, it is 
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possible to choose the axes in a number of different ways, for example 
for certain crystals it may be possible that two or even three of the axes 
are at right-angles to each other. 

When the axes have been chosen they form a unit cell, i.e. a parallele¬ 
piped of sides a, b, and c and each crystal consists of a repetitive 
stacking of unit cells. The volume of this fundamental building block 
is important and is simply the volume of a parallelepiped of sides a, 
b, and c which is (see 9.9.1) given by V and V — a . b X c. 



Points of the crystal which coincide with cell corners are given by 
integral values of at, y, and z but points of the cell faces or inside the 
cell will have fractional values of x, y, and z. 

Any particular direction in the lattice is specified by the direction 
ratios [x:y:z] or [Xx:Xy:Xz] where X is chosen to avoid fractional 
coordinates. For example the line from the origin (0, 0, 0) to the point 
A of Fig. 17.25 at the middle of a cell face, i.e. to the point (£, 1) 

has direction ratio 1] or [1:1:2] or simply [1, 1, 2], 

The faces of crystals, and also other cross-sections within the crystal 
lattice, are planes and in crystallography it is necessary to have a 
system of orientation of such planes. The crystallographer uses what 
are called Miller Indices and arrives at a trio of numbers as follows: 

(i) Find the intercepts, on the axes, of the plane in terms of the base 
lengths a, b, and c. 

(ii) Write down the reciprocal of these numbers and form the smallest 
integers in the same ratio. 
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For example, if a plane has intercepts (4a, 3b, 2d) on the three axes then 
the three reciprocals are (£, }, f) and the Miller indices are (3, 4, 6). 
However, from Section 9.7 it is known that the orientation of a plane is 
given by its normal and hence the Miller indices must be related to the 
normal. In order to examine this relationship it is necessary to introduce 
a space lattice related to the crystal axes and known as the reciprocal 
lattice. The three base vectors forming the reciprocal lattice are desig¬ 
nated a*, b*, and c* and are given by 

a* = (b x c)/F; b* = (c x a)/F; c* = (a x b)/F 

where F = (a . b X c) is the volume of the unit cell. The axes a*, b*, 
and c* are respectively perpendicular to the planes of b and c; c and a, 
and a and b. Note that 

a* . a = b* . b = c* . c = 1 

and a* . b = a* . c = b* . a = . . . = c* . b = 0. 

Fig. 17.26 shows the space lattice a, b, and c together with the 
reciprocal lattice vector a*. 



Using the property of the scalar product, with 6 the angle between 
a and a* 


a* . a = a*a cos 6=1 
i.e. a* = l/acosd. 

This result shows that the length of the base vector a* is the reciprocal 
of the projection of a onto a*. 

Consider now any plane that has intercepts on the crystal axes 0 xyz 
of lengths a a, fib, and yc as in Fig. 17.27. The Miller indices are the 
smallest integers in the ratio [J, }, y]. 
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Fig. 17.27 

Then in vector terms 

OA = aa, OB = /5b, OC = yc 

The normal to the plane is a vector n perpendicular to AB and CB, i.e. 
to (/3b — aa) and (yc — /3b). Thus (see Section 9.6.2). 

n = (/3b — aa) X (yc — /5b) 

= /3y(b X c) — ya(a X c) + a/S(a X b) 
since b x b = 0. 

In terms of the reciprocal vectors 

n = /3ya*F + ayb*F+ a/3c*F 

= *Py v (i a * + p h * + ~ c *) 

It follows immediately that 

* 1 * 1 . 1 
n* = - a* + -x b* + ~ c* 

a p y 


is a normal to the plane. 
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But (1/a, 1 If), 1 /y) are proportional to the Miller indices which have 
immediate reference to the normal when the latter is expressed in terms 
of the reciprocal vectors. 

Another important concept of interest to the crystallographer is to 
know the condition that a certain direction specified by [p-q-r] 
is parallel to a given plane of Miller indices ( u, v, w). 

Using the previous results the direction [p-q-r] is specified by the 
vector 

r = pa + <jrb + rc 

whilst the plane has its normal in the direction of r* where 
r* = «a* + vb* + we* 

The condition that [p-q-r] is parallel to the plane is simply that r is 
perpendicular to the normal r*, i.e. tlat 

r.r* = 0 

i.e. (pa + qb + re) . («a* + rb* + wc*) = 0 

i.e. pu + qv + rw = 0 

17.5 COMPLEX NUMBER 

The representation of a complex number z as a vector in the Argand 
diagram and the Euler form z = r e w = r cos 0 4- ir sin 6 give z a 
geometrical interpretation and also show how a complex number may 
be related to the trigonometrical functions. These two representations 
are essential to the understanding of the way in which complex number 
analysis can be used in an engineering context. The simplest major 
application is in the field of alternating current theory which is concerned 
with the response of a network to applied alternating currents or 
voltages. The elements of this theory are considered in this section. 

17.5.1 Alternating current networks 

The basic alternating current which excites an electrical network is of 
the general form 

i = I 0 cos (cot + a) 

in which / 0 is the amplitude, co is the angular frequency and a is the 
phase angle. Since this current may be expressed in the form 

i = (/ 0 cos a) cos wt — (/„ sin a) sin wt 

it is sufficient to consider the response of the network to the basic 
current 

i = A cos a>t or i = B sin cot 
Now since (see Section 10.4) 

e ;cBt _ CQS _|_ j s j n wt 



APPLICATIONS 


501 


where y 2 = — 1, it follows that these basic currents may be obtained 
by taking either the real or the imaginary part of the complex current I 
in the form 

I = I 0 c lat 

Note The use of j 2 = — 1 instead of i 2 = —1 as in Chapter 10 follows 
Electrical Engineering practice where i is invariably used to represent 
a current. 

For example if I 0 = 3 and co = 2 then the real part of a complex 
current 3e 2/f is 3 cos 2t and the imaginary part is 3 sin It. 

Further, a more general type current with a phase angle can be 
simply represented as the real part of a complex current. For example, 
the current 

i = 2 cos (4 1 - tt/3) 

is simply the real part of the complex current / where 
I = 2e ,(4t-1T ' 3) 

i.e. / = (2e-'*' 3 )e'« 

which is of the form 

I = I 0 s ,ot 


with /„ = 2e~ i7rl3 and co = 4. 

Since the voltage response to a sinusoidal current is also sinusoidal it 
is to be expected that a complex current excitation of an electric 
network will lead to a complex voltage response (and vice versa). 

Consider now the voltage response of a simple network to a current 
of the basic form 

i = I 0 cos cot, I 0 real 

Suppose the current i (amps) flows in the network of Fig. 17.28 which 
consists of a resistance R (ohms), inductance L (henries), and capaci¬ 
tance C (farads). 



Fig. 17.28 
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The voltage response of the elements of this network are v R , v L , and 
v c respectively and, if q is the charge of the capacitance, 

v R — Ri — RI 0 cos cot 
v L = L — = —a)LI 0 sin cot 

Va = qlC=^jidt = ^smcot 

If the current i is expressed as the real part of a complex current I, i.e. 
i = Re I = Rel 0 e'“‘ 

then in like manner the voltages v R , v L , and v c are the real parts of 
complex voltages V R , V L , and V c obtained as follows: 

(i) v R = RI 0 cos wt = Re RI 0 e i<ot = ReV R 


00 


(iii) 


v L = —aoL I 0 sin cot = Re jcoL I 0 e /mt 
= Re V L 
V L = jcoL / 

v '-£5 sinM - Re {-dc , ° e '“‘) 

= Re V c 

Vr =^-I = —I 
c coC jcoC 


In each of the cases (i), (ii), and (iii), the response of the element to a 
complex current / is a complex voltage V of the form 


V = ZI 


where Z is a complex quantity called the impedance of the element of the 
circuit. 

For a resistor R, the complex impedance Z R is R, and is real, 

for an inductance L, the complex impedance Z L is jcoL and is 
imaginary, 

for a capacitance C, the complex impedance Z c is 1/jcoC and is 
imaginary. 

The significance of the representation of the current and voltage 
in complex form may be appreciated after reference to Section 10.1 
where the relationship between vectors and complex numbers was 
investigated and it was noted that a multiplication by j (or i of Chapter 
10) had the effect of a rotation through a right-angle in the positive 
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sense. Let the impedances Z R , Z L , and Z c be represented on the Argand 
diagram as in Fig. 17.29. 


imaginary axis 
! Z L =ju)\- 

I z R = R real axis 


f Z c = —j/aiC 

Fig. 17.29 

The impedance Z R is represented by a vector of length R along the real 
axis whilst the impedance Z L and Z c are represented by magnitudes coL 
and 1/coC respectively along the positive and negative imaginary axes. 

The voltages may also be represented on an Argand diagram, and 
Fig. 17.30 shows the complex current I = I 0 e iat together with the 
voltages V B , V L , and V c 



The Fig. 17.30 gives a geometrical (complex) interpretation of the 
fact that V R is in phase with the current I, that V L leads the voltage V R 
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by a right angle (i.e. there is one right angle difference in the appropriate 
sense between the phase angle of V R and V L ), and that V c lags the 
voltage V B by one right-angle. The voltages V L and V c are exactly tt 
(or 180°) out of phase. 

As time increases the angle cot increases and the voltage vectors of 
Fig. 17.30 may be considered as rotating around 0 with angular velocity 
to the relative phase angles being unchanged. The total voltage V 
across the network is the vector sum of V R , V L , and V c and this is 
represented on Fig. 17.30 where the angle <f> is the angle between V and 
I. In this simple network, if coL > 1/wC then the voltage V is in the 
direction indicated in Fig. 17.30 and, because this resultant complex 
voltage is on the same side as the inductive voltage V L , the network as a 
whole is said to show an overall impedance which is inductive and the 
voltage V then leads I by a plane angle <j>. Should toL < 1/coC then the 
voltage V would be on the same side as the capacitative voltage V c and 
the network would then exhibit an overall impedance that was capaci¬ 
tative and the voltage V would lag the current. 

The simple circuit of Fig. 17.28 may be replaced by an equivalent 
circuit as in Fig. 17.31 in which the quantities /, V, and Z are the 


— V - 
Fig. 17.31 


complex current, voltage, and impedance respectively. The complex 
impedance of the equivalent circuit is 

Z = Z B + Z L + Z c 


and 


— R + jwL H- 

ju)C 


Example (i) The actual current / in the network of Fig. 17.28 is 
i = 3 cos At, find the overall voltage v and also v c . 




Solution In complex form 


APPLICATIONS 


505 


/ = 3e« ( 

and r = 30“ (*+,'(„£-;!)} 

Hence v = ReV = Re 3(cos At + j sin At) jf? + j ^coL — 
and v = 3R cos At — 3 {o>L — — j sin At 
The complex voltage across C is V c and 



and hence v 0 — Re V c 



Example (ii) The actual current i in the network of Fig. 17.28 is 
i = 2 sin 3 1. Find the voltage v in the case R = 2,L=\,C= 1. 

Solution The current is 

i = 2 sin 3t = 2 cos (3 1 — 7t/ 2) 
the complex current is 

/=2e' <3t -* /2) 

the complex voltage V then has the form 

F= 2e~ 2 )^ +7 -( 3£ __Ljj 

= 2(-y)e«‘{2+;(7/6)}. 

The actual voltage v is then 

v = Re V 

= 4 sin 3 1 + (7/3) cos 3 1 

Note It is assumed in the above process that the circuit is excited by a 
current i which, although time-dependent, is in its steady state. That is, 
any transient effects due to switch-on phenomenon have died away. In 
a mathematical sense this means that the circuit switch was closed at 
time t = — oo. 

Consider now a more complicated electrical network as in Fig. 
17.32 which shows the actual circuit (a) and the equivalent complex 
form (b). When a sinusoidal steady-state voltage of angular frequency co 
is applied across the terminals AB, each of the elements of the circuit 
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Cl c 2 



(a) (b) (c) 


Fig. 17.32 

has an impedance dependent on that frequency. The complex voltage 
V(= V 0 e y “‘) is applied across AB and the complex impedances are 
given by 

z ‘ = si' z ’ = ^- z > = *' Z ‘=> L - 

The impedances may be combined together by the usual series and 
parallel rules the network then being equivalent to a single impedance 
Z, as in Fig. 17.32(c), such that 

z -- l + ri+_i_r 

coCj. ^ IR ^ jcoL -jjcoC 2 J 
-j . VR + jXcoL-llwCJ ] - 1 
coC, + L Rj(<»L - 1 /ojC 2 ) J 

7- ^ ~ J ‘ l ~ 

1C ' coQ + i? + y(o*L - l/ft)C 2 ) 

The complex current I is then given by 
/= VjZ 

and the actual current i flowing through the network would then be 
obtained by taking the real part of VjZ. 

When dealing with impedances in parallel it is sometimes more 
expedient to work in terms of the reciprocal 1/Z. This quantity is called 
the admittance Y and it follows that two impedances Z 1 + Z 2 in 
parallel is equivalent to two admittances Fi and Y 2 in series. 

Resonance of a network 

Consider the simple R, L, C, series circuit of Fig. 17.28 excited by an 
applied sinusoidal voltage of angular frequency w. Suppose the complex 
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voltage is V then the complex current / flowing in the network is given 


where Z = R + J(ojL - 1/mC) 

The magnitude of the current, |/|, for given values of R, L, and C is 
dependent on a> and reaches a maximum value when |Z|, i.e. 

VR 2 + (mL - l/<wC) 2 

is a minimum. This minimum value of \Z\, for varying to, is given when 
co = co 0 such that 

WqC 

i-e. Wq = 1/VZc. 

The impedance Z is then purely resistive with the value R. 

The graph of |/| plotted against <w has the shape of Fig. 17.33 and is 
known as a resonance curve, the frequency <u 0 being called the resonance 
frequency. 



Note that at this resonant frequency the impedance of the circuit is 
purely resistive. This fact provides a simple criterion for resonance of 
any circuit, namely that at resonance the imaginary part of the complex 
impedance Z of the circuit is zero. For circuits with elements in parallel 
it may be more convenient to work in terms of admittance. Now 
suppose 

Y = 1/Z = A +jB 
2 _ A . B 
A 2 + B 2 J A 2 + B 2 


then 




508 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 
and the condition for resonance is 

ImZ = 0 

i.e. B = 0 

which is the same as ImY = 0. Hence for resonance the imaginary 
part of either the circuit impedance or admittance is zero. 

Example Find the resonance frequency for the circuit of Fig. 17.34. 


Solution The admittance Y is given by 

r -'l z -RTPL + i aC 

R — jo)L . 

= Ri+coW +J0)C 
The resonant frequency co 0 is given by 
Im F(oj 0 ) = 0 

i.e. co 0 Ll(R 2 + (OqL 2 ) = co 0 C 

i.e. co 0 = Vl/LC - (. RIL) 2 . 

Note that this frequency differs from that of the simple R, L, C series 
circuit but that it approximates to 1 j^/LC if R -> 0. 

Resonant circuits play an important role in the electric circuitry of 
radio, television, measuring and filtering equipment. Consider for 
example the network of Fig. 17.22 of Section 17.3.3. The impedance of 
such a network when an input voltage of frequency co is applied, is 
given by 

Z = ja>L + X, 

where — = Fi = —^ + r—p — ^ 

Zi —jjmC jcoL—j/coC 

= JmC ~ (wL -1/ftic) 




and hence Z becomes 
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Z = jcoL - j[(oC - 1 l{wL - l/wC)]' 1 
For resonance it is required that ImZ = 0, i.e. that 


ojC - ll(a>L- 1/coC) 

and at the frequency given by this equation the circuit presents a (zero) 
resistive impedance. 

If co 2 LC = X, then the resonance condition is 
X- 1 

i - i/a -1) 

i.e. X 2 - 3X + 1 = 0. 

This gives the two natural frequency of resonance as already obtained 
in the earlier work on filter theory of Section 17.3.3. (see page 493). 

The A.C. Bridge 

Consider a network of impedances as in Fig. 17.35 with complex 
cyclic currents I u I 2 , and 7 3 due to a complex voltage V all at frequency 
co. The network equations are given by 

AZi + (A - / 2 )Z 5 + (A - / 3 )Z 4 = 0 
(/ 2 - I X )Z 5 + I 2 Z 2 + (/ 2 - / 3 )Z 3 = 0 
(Is - 70Z 4 + (I 3 - I 2 )Zs = V. 



This type of A.C. bridge is adjusted by varying the impedances so 
that the current flow in the indicator Z 5 (e.g. ear-phones) is zero. This 
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requirement is I x — 7 2 = 0, with 7 4 , 7 2 , and 7 3 non-zero. With zero 
current in Z 5 the first pair of the above three equations reduce to 


l x {Z x + Z 4 ) - 7 3 Z 4 = 0 
/ 4 (Z 2 + Z 3 ) - / 3 Z 3 = 0 

from which it follows that 7j, 7 3 are non-zero providing 

|Z 1 + Z 4 z 4 | 

I Z 2 + Z 3 Z 3 I 
i.e. Z 4 Z 3 = Z 2 Z 4 

or Z 4 /Z 4 = Z 2 /Z 3 . 


Thus the requirement that there is no current flow in Z 5 , i.e. that the 
bridge is balanced, is that the impedance in the four arms of the bridge 
are in the above ratios at the frequency w. Since the impedances are, 
generally complex quantities then by taking real and imaginary parts 
of the condition, two relationships may be obtained. 

The types of A.C. bridges and the uses to which they may be put 
are many and varied. As an example consider the particular bridge of 
Fig. 17.36 known as the Wien Bridge in which the resistance R x and 
capacitance C 1 are unknown and A: is a real positive number. 

Relating Figs. 17.35 and 17.36 then 

Z 4 = 7?i + 1 IJcaCu 
Z 2 = R 0 , Z 3 = kR 0 , 

Z = _I_, 

4 1/7? + j(oC 

and the bridge is balanced when 

(* x + jfak) kR ° = Ro (l/7?+>c) 

i.e. k [(t? 4 - ^ (1/7? +yo,C)] = 1 = 1 +J0. 

Equating real and imaginary parts gives two equations 
k(RjR + C/Ci) = 1 

mCR 1 = 1/twCjT? 
and, solving for C x and 7? 4 gives 

C ± = kC(\ + l/o) 2 C 2 7? 2 ) 

7?! = 7?/7c(l + <m 2 C 2 7? 2 ). 
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The values of Q and can therefore be found at frequency o>. Alterna¬ 
tively if C\ and R x are known it is possible to use this device to measure 
the frequency m of an oscillatory voltage. 

Suppose that Ci = C and R t — R (this is easily achieved by a ganging 
arrangement) then when the bridge is balanced the above equations 
lead to 

k = \ and oj = l/CR 

Thus if k is given the value of 1/2 and the resistances R lt R and capaci¬ 
tances Ci, C are coupled together so that they have identical known 
values the frequency of the oscillation is obtained by calculating 1 IRC 
when the bridge is balanced. 



17.6 DIFFERENTIAL EQUATIONS 

Engineering is frequently concerned with the rates of change of certain 
variables and the physical problem is often expressed simply as an 
equation of balance, e.g. a balance of mass, or heat, or energy. The 
balance equation itself may in the physical sense be a simple statement 
such as 

Input — Output = Accumulation 
over a specified time interval, or even an empirical law such as 
Force = Mass x Acceleration. 

When the statement is expressed using mathematical symbols it often 
represents a differential equation (derivatives being involved because of 
the involvement with rates of change). In this way a simple physical 
statement may lead to a (sometimes complicated) differential equation. 
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If this equation can be solved, either analytically or numerically, then 
the engineer must interpret the mathematical solution in.terms of the 
physical variables. Consider now two engineering type problems which 
lead to differential equations. 

17.6.1 The oil purifier 

The oil used in lubricating a ship’s engine becomes dirty after a 
certain time of running. It is not always possible, or necessary, to 
change the oil and a purifier may be installed within the ship. This is 
usually of the centrifuge type in which the dirt particles and other 
impurities are removed by centrifugal action. 

The mechanical process consists of passing the impure oil from the 
engine into a drain or settling tank where some of the larger impurities 
can be removed by gravity. The oil from the tank is then fed to the 
centrifuge where some of the impurities are removed and the part- 
purified oil is returned to the drain tank. The oil is returned from the 
tank via an output pipe to the engine. The process is illustrated in 
Fig. 17.37(a). 


part-purified from engine 



Fig. 17.37(a) 


kmBt 






cm8t 

DRAIN TANK 

c 


p8t 





Fig. 17.37(b) 
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The physical situation is analysed mathematically by forming a 
balance equation on the mass of impurities in the system. The Fig. 
17.37(b) shows the input and output of impurities in the drain tank. Let 
p be the net mass of impurities entering the tank per second; let c be the 
concentration of mass impurities in the tank at time t\ let k be the 
concentration of mass impurities returned to the tank from the centri¬ 
fuge; let m be the mass of oil entering the centrifuge per second; let 
M be the total mass of oil. The physical mass balance equation for the 
impurities in the drain tank (see Fig. 17.37(b)) is 

mass input — mass output = mass accumulation. 

If the concentration of impurities in the tank changes from do c + 6c 
in the time interval t to t + 6t then the mathematical statement of the 
balance equation is 

{pdt + k(mdt )} — {cmdt} = {M(c + 6c) — Me}. 

Dividing by 6t and proceeding to the limit as 61 -* 0, 
dc 

M -jj = p + km — cm 


p + km 
M 


which is a differential equation of the form of equation (15) of Section 
11.1.5, i.e. dyldx + ay = Q(x) with y = c; x = t; a = mjM, 
Q{x) =(p +km)lM. The solution is given by equation (18) of 11.1.5, 


c = A e mtlM + (p + km)lm 

where A is the arbrtrafy~£Dnstant of integration. Assuming that the 
initial concentration at / = 0 was c = c 0 then 

c 0 = A + (p + km)jm 

and hence c = c 0 e' mtlM + {k + pjm){ 1 — e _ mtlM ). 

The graph of c (the concentration of impurities in the oil) as a function 
of t is illustrated in Fig. 17.38. 

The ultimate (i.e. / -> co) concentration is given by (k + pjm) and 
even with no input from the engine (p = 0) it is impossible to reduce the 
concentration below the value k. Any further reduction in concentration 
can only be achieved by reducing k, i.e. designing a more efficient puri¬ 
fier. Writing 

(c — k — pjm) = (c 0 — k — pjm) e" mtlM 
the concentration will fall half-way towards its ultimate value when 
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i.e. the half-way concentration is reached after a time t = ( M/m ) log 2. 
This half-life time can be reduced only by increasing m (for a given 
total mass M) and this again modifies the design of the purifier. This 
design is optimised by taking all possible factors of the centrifuge into 
account, e.g. the speed, the maximisation of the input, the maximisation 
of the purification and the physical size limitation. 

17.6.2 Chemical reactions 

Reactions between chemical materials to produce a yield is a process 
which is of great importance in the chemical industry. The types of 
reactions are many and varied. Consider first only the simplest kind of 
reaction in which a chemical material A transforms into another 
material B possibly by reacting with a catalyst (which itself undergoes 
no change). There is assumed to be no change in volume of the system. 

If the concentrations of materials A and B at time t are c A and c B 
respectively then in a time interval dt the mass balance equation (i.e. 
mass flow in — mass flow out = mass accumulation) is given by (see 
Fig. 17.39) 

0 — kc A dt = (c A + dc A ) — c A , for material A 
and kc A dt — 0 = (c B + Sc B ) — c B , for material B. 

where A: is a constant called the reaction constant or rate coefficient. 

Dividing by dt and proceeding to the limit leads to a pair of differential 
equations, 







APPLICATIONS 


If the initial concentration of A is a then the first equation is easily 
integrated to give c A = a e~ kt . If further the initial concentration of B 
is zero the second differential equation leads to c B = a(l — 


f - c a t, c 8 



t + St, c A + Sc A t + St, c e + Sc B 


Fig. 17.39 

Note that this simple type of reaction proceeds until eventually 
(t -* go), c a reduces to zero and c B is a, i.e. all of A is converted into B. 

A more complicated situation arises in which B may react and 
convert back into A but at a different rate. This situation is illustrated 
in Fig. 17.40 in which the forward rate k x is different to the backward 
rate k 2 . 



Fig. 17.40 

The equations of reaction are now found to be 
= k 2 c B — k x c A 

a dc B 

and -Jf = he a - k 2 c B . 

These differential equations can be solved by noting that 
dcjdt + dc B jdt = 0. 

Thus, c A + c B = constant 

= a + b 

where a and b are the initial concentrations of c A and c B respectively. 
Substituting into the first differential equation leads to 
dc. 
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which has the solution (see Section 11.1.5) 

c, = + k 2 (a + *)/(*, + k 2 ) 

where E is the arbitrary constant. The initial condition on c A , i.e. 
c A = a, t = 0, gives 

E= a- k.fa + b)/(k x + k 2 ) 
and hence the complete solution is 

(a + b) , f (a + bY | 

‘--TT* + [ 0 ~TT*J e ‘ ’ 

c B = a + b — c A , 
where K = kfk 2 . 

The limiting (i.e. / —*■ oo) concentrations of c A and c B are then 
(a + b)l(\ + K) and K(a + b)j(\ + K) respectively. 

Note that in the particular case b = 0 and k ± k 2 (i.e. k 2 -*■ 0, 
K -> oo) the concentrations become 

c A = a e- fc i ( 

and c B = a(l — e _,Clt ) 

which are the results already found for the simple forward reaction. 

In the simple forward reaction the process progresses by converting 
a molecule of substance A into B and the reaction is called one of the 
first order. A more general reaction would be one such that two 
reacting substances A and B (say) are brought together and one mole¬ 
cule of A reacts with one molecule of B to form some resultant product 
(or products), i.e. 

A + B -► Products. 

Generally the reaction rate is dependent on the concentrations of both 
A and B and a generalised form of the governing differential equation 
for the rate of change of A is 



where a, and fi are the reaction orders with respect to A and B respec¬ 
tively and a + /? = n is the overall reaction rate, and k n is the rate 
constant. In a general reaction a and [i need not be integers, but 
usually n < 3. The reaction orders (i.e. the values of a and /?) are 
usually deduced from experimental observations. A particularly useful 
law followed by a number of reactions is that for which a = /S = 1, i.e. 
n = 2 and the reaction is called second order. The governing equations 
for this reaction are 

dc A dc B 

-df=~df == ~ k * C * CB - 
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This differential equation may be integrated on noting that 

i.e. c A — c B = constant = a — b = 1 (say) 

where a and b are the initial concentrations of A and B respectively. 
The differential equation for c A then has the form 

dc a 

-jf = ~k 2 c A (c A - X) 

which is of the type of Section 11.1.2, i.e. variables separable. Hence 
c ^ j dc A = —Xk 2 t + constant 

which leads to 

Q _ 

—-= De~ Xk A 

c A 

where D is an arbitrary constant of integration. 

Inserting the initial condition that c A = a, t = 0 leads to the evalua¬ 
tion of D and hence to 

a-b 

Ca l-(6/ fl )e- fc .<-«* - 

More complicated reactions may take place and possibly unwanted 
effects may occur, for instance although it may be required to convert 
material A into material B it might happen that a further material C 
is produced by a forward reaction from B as in Fig. 17.41(a) or by a 
side reaction from A as in Fig. 17.41(b). 


Fig. 17.41 

Consider as a final example the case of a reversible reaction A to 
B taking place and an unwanted irreversible side reaction to X also 


LZHZHZI 
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taking place as in Fig. 17.42 where k 1 and k 3 are the reaction constants 
A to B and B to A respectively and k 2 is the reaction constant from A to 



Fig. 17.42 


The governing equations are 
dc A 

— = -k 3 c A + k 3 c B - k 2 c A 
— k\C A — k 3 c B 


The first pair of equations do not contain c x and hence can be solved 
for c A and c B . Write D = djdx, then 

(jD + ki + k 2 )c A — k 3 c B — 0 
— k\C A + (D + k 3 )c B = 0. 

Operating on the first equation with ( D + k 3 ) and multiplying the 
second by k 3 and adding the resulting equations leads to 

{{D + k 3 )(D + k x + k 2 ) - k x k 3 )c A = 0 
or ( D 2 + KD + L)c a = 0 


where K= k x + k 2 + k 3 and L = k 2 k 3 . 

This second order differential equation has the solution (see Section 
11.4) 

c A = Pe-«<+ Qt-» 

where P, Q are arbitrary constants and a, /? are given by 
„ K± VK 2 - 4L 

-2- 
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and are both real and positive. The concentration c B may now be 
found from the first of the governing equations, i.e. 

, d 

a 3 c b — c A + k x c A + k 2 c A . 

= Pfo +k 2 - a) e“ ” f + 0^ + £ 2 - 
and finally c x from the last of the governing equations 



where R is a further integrational constant. The 3 arbitrary constants 
P, Q, and R may be determined from the initial conditions of con¬ 
centrations, i.e. if a is the initial concentration of c A and the initial 
concentrations c B and c x are both zero then 

a = P + Q 

0 = P(k x + k 2 ~ «) + Q(k y + k 2 -p) 

0 = R- k 2 P/x - k 3 Q/p. 

giving three equations for the three constants P, Q, and R. 

17.7 PARTIAL DIFFERENTATION 

Many physical quantities are functions of several (i.e. 2 or more) 
independent variables, e.g. the volume V of a circular cylinder is a 
function of its height h and radius of cross section r and V = irr 2 h-, the 
pressure/? of a perfect gas is a function of temperature T and volume v 
and p = RT/v where R is a constant. 

Engineering situations are frequently concerned with the change of 
one physical quantity with respect to variations in the other variables. 
These rates of change lead to partial derivatives, e.g. if p = RTjv then 
the rate of change of pressure at constant volume v is dpjbT while the 
rate of change of pressure at constant temperature T is dp/dv. Likewise 
it is possible to have other derivatives such as 8T/dp and dvjdT. Con¬ 
sider now two fields of engineering study that demand the use of partial 
differentiation. 

17.7.1 Elasticity. The displacement 

Suppose a straight bar of elastic material is lying along the x-axis from 
x = 0 to x = L. Let the end x = 0 be fixed and the other end be subject 
to a longitudinal force that causes the bar to extend. The Fig. 17.43(a) 
shows the static equilibrium state and Fig. 17.43(b) shows the situation 
after stretching. Because this motion takes place in one-dimension, i.e. 
that of the x coordinate, it is to be expected that any rates of change 
in this case will not lead to a partial derivative. 
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Fig. 17.43 


The point P moves to a new position P' and is said to have a displace¬ 
ment of u. The point Q (originally atx + dx) takes up its position at Q' 
(i e. the point x + u + dx + du), and the element PQ (originally fix) 
has changed to P'0' (now of length dx + du). Different points P along 
the bar will have different displacements, i.e. u is a function of x, 


u = fix). 
Then 


= fix + dx) -fix) 

-Udx + lMll 


after expanding f{x + dx) by Taylor’s theorem. 

In the simple theory of elasticity it is assumed that the change in u, 
i e du, is small enough to be represented by the first term in the series 



This function dufdx is called the rate of displacement or the displacement 
gradient and has an important role in Elasticity theory. Interest is 
centred on the change in length of the small element PQ and as a 
measure of this change it is usual to compare the change in length 
IP'Q' - PQ) with the original length PQ and to examine this ratio 
as the length element PQ tends to zero. This quantity is called the 
strain at the point P and takes the symbol e, i.e. 


P'Q'-PQ 
6 — ^ m „ PQ 

. ldujdx)dx + . 
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The ideas inherent in this measurement of strain can be extended to 
two-dimensional elasticity. Suppose a sheet of elastic material occupies 
a part of the usual two-dimensional space ofxandy and is distorted by 
a tension. The elementary length dx in the x-direction undergoes a 
change but so also does an elementary length 6y in the ;-direction. 
The Fig. 17.44 shows a small rectangular element PQRS of the elastic 
medium such that P is the point (x, y) and the sides of the element are 
dx and 6y. After a tension is applied to the medium the point P moves 
to F (x + u, y + v) and the other points take up new positions as 
indicated in Fig. 17.44. 



Fig. 17.44 


The distortion consists of 3 distinct parts, firstly a translation as a 
rectangle so that P takes up the position P', secondly an elongation 
of the rectangle to the shape P'Q"R"S" and thirdly a rotation of the 
sides to take up new position P'Q’R'S. The first part, i.e. the trans¬ 
lational movement may be ignored because the relative position of 
points are unchanged. The third part of the distortion is represented by 
the angles a and £ of Fig. 17.44. 

The quantities u, v are the components of the displacement PP' and 
are functions of position, i.e. u = u(x, y) and v = v(x,y). 

The coordinates of the points relative to P are as follows (assuming 
a and p are small): b 

Q : (dx, 0) 

S: (0, 8y) 

P': (u, v), i.e. (u(x, y), v(x,y)) 

Q': (dx + m(x + dx, y), v(x + dx, y)) 
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Q" : (bx + u(x + dx,y), v(x,y)) 

S': ( u(x,y + by), by + v{x,y + by)) 

S": (u(x,y ), by + v(x,y + <5y)). 

The component of strain in the x-direction is designated e n and is the 
unit change in length of the elementary length bx, i.e. 

P'Q’ - PQ 
fin — Lim p ~ 
pq -+o 

P’Q" - PQ 


P'Q" = bx + u(x + bx,y) - u(x,y) 

. Sm 8 2 u (bx) 2 

= dx + rx dx+ b?^r + - • • 

using the Taylor expansion for a function u with one variable y held 
fixed. 

Thus 


£ n 


■ Lim 2^+.. 


= du/dx. 

In a similar manner the strain component in the y-direction is 
designated £22 and is given by 

„ . P'S' - PS 
£22 — Lim no 
ps—o 

. P'S"-PS 
= Lim- 

6y—0 

„. ( dv/dy)by , 

= Lim---r . • • 

^-0 by 

= dv/dy. 

The extent of the shearing action which has distorted P'Q"R"S" to 
P'Q'R'S' may be examined by forming the difference between the 
right-angle QPS and the new angle Q'P'S', i.e. in Fig. 17.44 by the 
angle (a + ft. ^ 

Now a — tan a — Lim p , n „ 

6x-*o x y 

but Q'Q" — v(x + bx, y) - v(x, y) 

dv b 2 v (bxf 

~ bx 8x+ bx 2 2! + ' • ' 


and P'Q" -& + du/dx bx, 
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Hence, assuming that dujdx <€. 1, then 


= dv/dx. 

Similarly 

/? — dujdy 

and the shearing effect may be measured by 

„ , o 8v du 
* + f * = Tx + Fy 

This quantity is designated y 12 and is usually called the engineering 
shear strain. 


Note ( 1 ) The suffixes 1, 2 refer to the x- and ^-directions respectively. 
Some texts designate the strain components as e„, e yy , and 

Vxv 

Note (2) It may be observed that y 21 can be defined by interchanging 
the roles of x,y and of u, v in y 12 , i.e. y 21 = du/dy -(- dv/dx and 
it follows that y 12 = y 21 . 

These ideas may be extended to a three-dimensional elastic solid in 
which a point (x, y, z) is displaced to a new point (x + u, y + v, z + w). 
The strain components are then given by 

e n = du/dx y 12 = du/dy + dv/dx = y 21 

e 22 = dv/dy y 13 = du/dz + dw/dx = y 31 

£33 = dw/dz y 23 = dv/dz + dw/dy = y 32 . 

From a mathematical viewpoint it is more convenient to use the alter¬ 
native notation (x ls x 2) x 3 ) for (x,y, z) and (u u u 2 , u 3 ) for (u, v, w). In 
this notation the suffixes (1, 2, 3) refer to the three directions of x v 
and z. ’ 

Then 


£ n = du 1 /dx 1 y 12 = duJd Xl + du 2 /dx 2 = y 21 

e 22 = du 2 /dx 2 y 13 = d Ul /dx 3 + du 3 /dx x = y 31 

£33 = du 3 /dx 3 y 23 = du 2 /dx 3 + du 3 /dx 2 = y 32 . 

If further the mathematical shear strain is introduced by writing 

£12 = iyi2 = i( pu 1 jdx 2 + 8 u 2 /8 x 1 ) 

£13 = iy X3 

£23 = iy 2 3 
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then the six (mathematical) strain components may be written as a 
strain matrix e w , i,j = 1, 2, 3, i.e. 

/fill £12 £ 13\ 

\ fi 31 £32 £33/ 

in which s tJ — ^{dujdxj + dUj/dXi). 

This particular matrix array (e tJ ) has further properties which are of 
importance in the mathematical theory of elasticity. In the form (e tj ) 
the above array is known as the strain tensor. 

17.7.2 Fluid dynamics 
A. The material derivative 

Suppose a fluid (e.g. gas or liquid) particle at time t 0 is at a point P 0 
whose position vector is R with reference to a fixed point O. At time t 
suppose the same fluid particle is at the point P whose position vector 
is r. The Fig. 17.45 shows the path of the particle from P 0 to P. Note 
that, for fixed R, the vector r is a function of t only and r = F (t). 



The velocity V of the particle at P is given by 


Y 


In Cartesian coordinates with O as origin, r = xe 1 + ye 2 + ze 3 and 
the three components of the velocity (i.e. u, v, and w) are u = dxjdf, 
v = dyjdt, w = dzjdt. The particle at point P of the fluid has associated 
with it other vector functions such as acceleration and force and certain 
scalar functions such as pressure, density, and temperature. Consider 
first one of these scalar functions, say pressure p. 
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The pressure p at any point P may be considered as a function of 
both its position and of time, e.g. the pressure varies over a region at 
fixed time and also varies over a period of time at a fixed point Hence 
p may be written/? = f(x,y, z, t). However, since r is a function of t only 
it follows that x, y, z, are functions of t only and the above relation¬ 
ship may be re-written 

P =f[x(t), y(t), z(t), t ] 

= m 

where </> is the function / when expressed in terms of t only. 

It is possible to differentiate p partially with respect to x, y, z and t 
i.e. dfjdx, df/8y, df/dz, and d/Jdt and it is also possible to differentiate 
p as a function of t only, i.e. d<f>fdt. 

An application of equation (27) of Section 5.6.2 in which n = 4, and 
x u x 2 ,x 3 , x 4 are replaced by x, y, z, t leads to 

fy^dfdx ofdy cfdz dfdt 
dt 8x dt + dydt + dz dt + dt dt 
or in terms of the pressure p 

?P = fy, ty , ty, ty 

dt dt ^ dx + V 8y + w 8z 
where u = dx/dt, etc. 

It is worth stressing that although the same symbol p appears on 
both sides of this last result the function represented by p is different. 
It is very important to be able to distinguish between dpldt which 
means 



and dpldt which means (d/dt)<f>(t). The term cpjdt on the right side of 
the above result represents the rate of change of pressure with respect 
to time when P is considered fixed, the other terms of the right side 
represent the rate of change of pressure at fixed time t as P moves with 
the fluid. The left side is then the total rate of change of pressure. This 
type of differentiation is often called “differentiation following the 
motion of the fluid” or simply “the material derivative ”. 

In exactly similar fashion any of the other scalar or vector functions 
which depend on position and time may be analysed. For example the 
density p may be differentiated to give 


dp_8p 
dt dt + 


f + v e / + w^ 

ox dy oz 


and the velocity V may be differentiated to give 


dy _ dV 8V dV 8V 
dt - dt + U lTt + v ~dt + W ~di' 
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This latter result being an expression for the rate of change of velocity 
is simply the acceleration of the fluid particle at the point P at time 
and naturally plays an important role in the derivation of the equation 
of motion of a fluid. Although the equation of motion will not be 
derived here it is possible to state that an application of Newton s 
Second Law, that applied force is proportional to rate of change of 
momentum, can lead to an equation of the type 



where R represents all the forces acting on a unit of mass of the fluid 
at P (e.g. body forces due to gravity or spin, pressure forces, viscous 
forces, electromagnetic forces). 

Finally, note that dp/dx, dp/dy, dp/dz are the 3 components of grad p 
and that, since (w, v, w) are the three components of V, 



may be written as a scalar product 

V. grad p 

or V.Vp. 

The operational form of the material derivative plays an important 
role in fluid dynamics and has the form 

rf_3, — 4- I, — 4- w —- 

dt 8t +U dx^ dy dz 

or = | + ( v • grad) 

or =i + < V -' 7 >- 

B. The Continuity equation 

An important concept in fluid dynamics is that, although the material 
is moving and its volume and density are changing, the mass of any 
given volume is conserved, i.e. that mass is not annihilated or created. 

It is possible to arrive at the mathematical equation of conservation 
by considering any arbitrary shaped volume in the fluid but the 
mathematical processes involved are outside the scope of this book. 
However the same equation can be obtained for a particular shaped 
volume of fluid, namely a rectangular parallelepiped. The physical 
property of mass conservation is simply a balance equation over the 
rectangular parallelepiped that 

mass flow in - mass flow out = mass accumulation 
and the mathematical statement is a partial differential equation. 
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Suppose P is the point (x, y, z) of Fig. 17.46 and the sides of the 
rectangular volume are parallel to the x, y, and z axes and of lengths 
dx, dy, and dz respectively, e.g. W is the point (x + dx, y + 8y, 
z + dz). At P the fluid velocity V(x,y, z, t) has components (u, v, w) and 



the density isp(x,_y, z, t). Consider the flow in they-direction across the 
face PQRS. The only contribution to this flow into the region is due to 
the velocity component v in the direction y. The other two velocity 
components u, w being parallel to the face PQRS do not contribute 
anything to the flow into the parallelepiped. To the first order of small 
quantities it may be assumed that the velocity component v is constant 
over the face and hence the mass flow into the region in time dt is 
(density) x (area) x (velocity) x (time), i.e. 


( P v )atx.y.z 8x8z8t 

The flow out of the region over the parallel face UVWTdXy + dy is 

(pv)at x.y+6yj dx dz dt 

The function (pv) x _ y+6y _ 2 has a Taylor expansion (using equation (18) of 
Section 5.4 with h = 0, k = dy and inserting a third variable z), i.e. 
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Retaining only the first two terms of the expansion the difference (mass 
flow into region over PQRS — mass flow out of region over UVWT) is 
given by 

— — (pv) dx dy dz dt 

Calculating the similar results for the other four faces leads to a total 
contribution of (mass flow in — mass flow out) given by 

- (p«) + (pv) + ^ (pw)J dx dy bz dt 

During the time interval t to t + dt the density within the volume 
changes from p to p + dp and the mass accumulation is (Density 
increase) X (Volume), i.e. 

dp dx dy dz 

(Note this assumes that the density over the region is uniform which 
is strictly not true, but the error is small, if dx, dy, dz are small, and 
decreases to zero as the volume shrinks to zero.) 

Forming the balance equation leads to 

dp dx by bz = - ^ (pu) + ^(pv) + dx dy dz dt. 


Dividing by the volume element (dx dy dz) and by dt and proceeding 
to the limit so that dx, dy, dz, dt ->-Q and remembering thatp is function 
of ( x,y, z, t) leads to 


dp 

dt 


which is the equation of continuity or the mass conservation equation. 
An alternative form may be obtained on differentiating the product 
terms and using the material derivative, i.e. 


, d P_ , ?P 

dt + dx + dy 


dp / du dv dw\ 
W dz+ P \0x dy + dz) 


= 0 


dp (du 

I+pyirx* 


dy dz) 


This differential equation together with the equation of fluid motion 
considered earlier in this Section form two of the most important 
equations of fluid dynamics. 

Finally, it may be noted that in the simple case of liquid flow (e.g. 
hydraulic motion) the density is constant and the continuity equation 
reduces to 


du dv dw 
dx dy dz 


= 0 . 
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EXERCISE 1 (page 17) 

1 44; 0; x + l/* 2 ; (jc + h) 2 + l/(x + h). 

2 1; 1; -1. 

3 (.i)x={l+y)l(l-y),x¥‘-l,y¥‘l 

00 X = y 2 l(y 2 - 1), X < 0 or jc > 1 ,y ^ ±\ ; 

(iii) x = i[->> ± V3(4 - /)], -2<y <2, -2<x<2; 

(iv) x = ±V[yl(y — 1)], x # ±l,y < 0 or y > 1. 

4 (i) continuous; (ii) not continuous; (iii) not continuous; 

(iv) not continuous. 

8 (i) -3 < x < 3; (ii) r<-3orr> 3; (iii) -2 < * < 6 
(iv) 0 < x < 2. 

9 V3/2; -v'3/3;2 V '3/3; -2. 

10 a/3/2; -V3; -2; 2^3/3. 

11 (0 •; (ii) (iii) *»-**- 

12 (i) straight line parallel to the z- axis; 

(ii) straight line 4 units above and parallel to the line v = * in the xv- 

plane; J 

(iii) circle in the plane z = - 3 centre (0,0, -3) radius 3; 

(iv) ellipse centre (0,0,0) in the xz-plane, semi-axes a, b 

13 (i) circle centre in Cartesian coordinates (0,0,2), radius 3, in the plane 

(ii) straight line from the origin in the plane <f = making an angle 
45° with the z-axis. 6 & 

14 (i) Circle radius 5, centre the origin lying in plane <p = in; 

(ii) the intersection of the sphere r = 4 and the cone (t = in is a circle 
of radius r sin d = 2V2; 

(iii) the plane <p = in and the cone 0 = in intersect in a pair of straight 

lines from the origin; ° 

(iv) a semi-circle radius 2 in the yz-plane centre (0,0,2) on the z-axis. 

15 (l) r = 2, p 2 + z 2 = 4; (ii) r = 9 cos 0, p 2 + z 2 = 9z; 

(iii) z 2 = x 2 + y 2 , 6 = in,e = in; 

(iv) x 2 +y 2 + z 2 = 4z, p 2 + z 2 = 4z. 

EXERCISE 2 (page 45) 

1 u;0. 

2 2; -2. 

3 (i) 2 sec 2 x/(l - tan x) 2 or sec 2 )^ + x ); 

(ii) sin x cos x/VO + sin 2 x); (iii) cos 2x - > sec 2 x. 

4 r ( 7 2) 7 3;when * = °>y = 28, when y = 0,x = -4; point of inflexion 
(-2, 14); y == x 3 for large x. 

5 sin //(1 -f cos t). 

6 [(1 + 20/(1 + OJ 2 . 

7 3. 

8 2y — 9x + 9 = 0. 



530 


MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


9 (2, 4) and (-2, -4). 

10 n = 2k, 3"(—l) k ; n = 2k + 1, 3"- 3 n(n - 1)(« - 2)(-l) fc - 1 

11 maximum (3-7, 6-4) minimum (5-8, 5-0) approximately; common tangent 
y — x + 1 =0; curve crosses x-axis between x = 0 and x = 

12 

h fy f,X fyy f,y = fy x 


1 ly -xly 2 0 2xly 3 -1/y 2 

-x(.x 2 + y2)-3/2 +^,-3/2 ( 2 JC 2 - rbCx 2 + X V 5,2 (2j 2 - X 2 Xx 2 + JlV 6 ' 2 3x>'<> 2 + J 2 ) _5 ' 2 

13 8 2 u/8x 2 = {n(n + 2)x 2 — nr z }/r n+4 

14 Hint; Differentiate each equation implicitly partially with respect to x 
and partially with respect to y. 

— (u + v)/(u 2 + V s ), (2v — 3k/2(m 2 + v 2 ), (v — u)/u 2 + v 2 ), 

(2 u + 3u)/2(« 2 + v 2 ). 


EXERCISE 3 (page 61) 

1 (i)0; since there exists an interval (— h,h ) containing all terms after 
the Mh. 

(ii) no finite limit. 


(0 - 

(ii) i 

(iii) 1 

(iv) 0 

(v) oscillates 

3 (i) 3 - l/3 n -\ C, 3. 

(ii) 1(10 + 5/(-2)"-\ C, 10/3. 

(iii) 3(2" - 1), D. 

4(0 * + £r”# .x*-l; i 


(vi) sequence oscillates 

(vii) 0 
(viii) 0 

(ix) a n ~* co 

(x) «„ - - oo 


5 (i) C, -1 < x < 1, D all other x; 
(ii) C; (iii) C. 

6 (i) C, — 1 < x < 1, D all other x; 

(ii) A.C. for all x. 

(iii) A.C. all x. 

7 (i) D; (ii) C; (iii) C. 

8 (i) D; (ii) D. 

9 (i) 27; (ii) 1; (iii) oo. 

11 (i) — 1 < x < 1; (ii) 0 only. 


EXERCISE 4 (page 76) 

1 (i) 2x exp (x 2 ); (ii) sec x tan x exp (sec x); 

(iii) 3x 2 log (1 + x 3 ) + 3x 5 /(l + x 3 ); (iv) 2x[l + log (1 + x 2 )]; 

(v) tan -1 x; (vi) l/\/(x 2 — 3); 

(vii) 4x cosh (2X 2 ); (viii) log x. 

2 Find log y in each case and differentiate implicitly: 

(i) x x (l + log x); (ii) (sin x) r [log (sin x) + x cot x]; 

(iii) KV*)*(1 + log x) (iv) x^ z (2 + logx)/2Vx; 

(v) x slnl [cos x log x + (sin x)/x]; (vi) 10 loerslnz cot x log 10. 
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3 log (1 + x) + x/( 1 + x); 0; 2; -3. 

4 (i) 2(cosh x cos x - l)/(sinh x - sin x) 2 ; (ii) - sech x tanh x; 

(iii) tanh -1 x; (iv) sech x; (v) 2/sinh 2x. 

5 0 < y < 1, minimum at (0,0), x ± oo, y -*• 1 —. 

6 -e -al [a cosh (pt - bx) + b sinh (pt - foe)]; p e -< “ sinh (pt - bx). 

7 (i) 2 X log 2; (ii) sin 2x exp (sin 2 x); (iii) (a^ 1 log a )/2 Vx; 

(iv) 3 cosh 2 x cosh 4x; 

(v) y'/y = (6x)/(x 2 + 1) - 2/(2x - 1) - 6/(3x + 2); 

(vi) -2/(1 - x 2 ); (vii) (2x - 1) exp (1/x); (viii) o*x° (log a + n/x). 

8 x y \y log x 4” 1); x y (log x) 2 . 

9 minimum (0, 0); maximum [4, (4/e) 4 ]; asx-*oo,j-*0+. 

10 minimum (0, —3). 

12 (i) exp (x sin x)[x cos x + sin x]; (ii) (log x) r [log (log x) + (log x) -1 ] 

(iii) (1 + x + log x)/(l + x) 2 ; (iv) — l/[x(logx) 2 ]; (v) sec x. 

17 (2x + y)/(x - 2y). 

18 x = 1-70, v = 0-597 or x = -0-597, v = -1-70 

19 14-11. 

20 -0-405,0-916. 


EXERCISE 5 (page 98) 

1 Consider the derivative of the difference 1 — e -/i — fle - ^ and of 
P - 1 + e-*. 

2 -2-5% 

4 3 p h + irp a . 

5 (i) i; (ii) (iii) 1; (iv) 2; (v) i; (vi) 2. 

6 (i) i; (ii) 0. 

7 (i) i; (ii) 1; (iii) 2; (iv) — 1/w; (v) J. 

9 ~J' 

10 S (—l) n + 1 x n /», —1 < x < 1. 

(0 ^ + x 3 /3 + x s 15 + . . .; (ii) x + x 3 /6 + x*/24 + . . .. 

11 x - *x 2 + Jx 3 - . . .; 1 - x 2 /2! + x 4 /4! + . . .. 

12 (i) (~l) B x 2B+1 /(2n + 1)1, all x; 

(ii) £ x" sin (0 + inn)/nl, all x; 

(iii) £ («)”/«!, all x; 

(iv) £ x n , | x| < 1; 

(v) £ Ox", |x| < 1. 

13 £ (-l) n + 1 (WYln, |x| < V2. 

14 1+ x + «x 2 - j 2 ). 

15 (i) minimum (1, -2); (ii) saddle point (1,2); (iii) maximum (0, 1). 

17 minimum (0, 0); saddle points (±5, ±6). 

18 minimum (0,0); maxima (0, ± 1). 
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19 minimum (0,0); saddle points (±1,0). 

21 -34/9. 

22 saddle point (0,0); minima (0, ±a). 

28 Hint for last part: evaluate 



29 

30 Slope of tangent is —3. Equation of the tangent is_y = (—3)0 — 4). 

EXERCISE 6 (page 123) 

C and A are arbitrary constants of integration. 

1 3a 2 A 7 /7 + C. 

2 x 3 + 2x 2 + 2x + C. 

3 o 2 a 7 /7 + lab; c 1 + b 2 x + C. 

4 fV(2 ax 3 ) + C. 

5 2Ve x + C. 

6 arcsin (x/V 2 ) — argsinh (x/\/2) + C. 
lx — coth x + C. 

8 C+ 2V/(l + log 2). 

9 C - iV(5 - 4a). 

10 C — a log |a — x|. 

11 x+ ilog|3x+ 1| + C. 

12 lx 2 + x + 2 log \x — 1| + C. 

13 i arctan (3x) + C. 

14 log 11 + cosh x\ + C. 

15 VO 2 + 1) + C. 

16 log |tanx| + C. 

17 C — log [sin a + cos x\. 

18 log 0 + cos jc| + C. 

19 sin x — l sin 3 x + C. 

20 C — cos x + | cos 3 x — l cos 5 x. 

21 l sin 3 x — i sin 5 a + C. 

22 C — l cos 6 4* + l cos 8 4*. 

23 C - 4 cot 8 x. 

24 {C + 2* + 4 sin 2x - 4 sin Ax - i sin 6*}/32 

25 C — cot A — 4 cot 3 A. 

26 C - a + 4 tan 5a. 

27 C + i cos 2a — (cos 8a)/16. 

28 2 log |a + 2| + 5/(a + 2) + C. 

29 4 log U(a + 4) 9 /a]. 

30 log [A(x 2 + 4a + 8)] — 5{arctan(A + 2)/2}/2. 

31 2\/(x 2 + 4a + 8) - 5 log [(a + 2) + V(* 2 + 4a + 8)] + C. 

32 arcsin [(2a + l)/\/5] + C. 

3 arcsin 4(2 — a) — V(4a — a 2 ) + C. 
arcsin (2a — 1) + C. 
arctan e 1 + C. 

4V3 arctan [(a + 2)/V3] + C. 

37 4 log [a 2 /(1 - a 2 )] + C. 

38 log [A(x + 1) 2 (a - 3) 3 ]. 

39 log [a/(a + 1)] - 1/(a + 1) + C. 
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40 i log [(1 + x)l(l - x)] + i arctan x + C. 

41 4 log [A(x + 5) 2 (x - 1). 

42 C + 4x 2 - x + log [x(x + 1)/(jc - 1)]. 

43 4 log 2. 

44 9/128. 

45 log 2. 

46 4. 

47 4- 

48 2^3/9. 

49 in - 1. 

50 x arcsin x + V(1 — x 2 ) + C. 

51 4x 3 arctan x - ix 2 + 4 log (1 + x 2 ) + C. 

52 e xla (ax - a 2 ) + C. 

53 sin x — x cos x + C. 

54 x" + 1 [(log x)l(rt + 1) - 1 /(« + l) 2 ] + C. 

55 f log (4/3). 

56 4(" - 2). 

57 n - 2. 

58 (2 - 3e-")/13. 

59 / n = e'x" - «/,_!; ^(x 3 - 3X 2 + 6 x - 6 ) + C. 

60 13/15 - J*. 

61 24(e* - l)/85; 3(6"- + 1)/13. 

62 (3«r 2 - 16)/64; (60 17 - 149)/225. 

63 w,0. 

64 C+ [arc cos (V2/x)]/V2 if x > V2. 

65 C- log(l + e-*). 

66 (3x 2 - 2) 10 /60 + C. 

67 f(x - 2)V(x + 1) + C. 

68 log [sin x + V0 + sin 2 x)] + C. 

69 -4- 

70 377/16A 5 . 

71 Divergent. 

72 log 2. 

73 2. 

74 Divergent. 

75 Divergent. 

76 77 . 

77 J log | (tan 4x + 2)/(tan 4x — 2)| + C. 

78 C - x + 2/(1 - tan 4x). 

79 C — log |cos x — sin x|. 

80 T7V3/9. 

EXERCISE 7 (page 144) 

1 4. 

2 3 nab IS. 

3 677 a 2 . 

4 77tf 2 . 

5 1677 / 15 . 

8 7t 2 (t7 2 - 6)/48. 

9 <3 2 /l 5; no 3 120. 

10 377 / 8 ; 5/6. 

11 16/3; 377 / 8 , 4t7 2 . 
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12 (2 2n+2i nl)/(2n + 3)(2 n + l)(2n - 1) ... 3; area 256^/2/105; 
centroid (4/3, 7V2/16). 

13 3-n-/4; 4w/5. 

14 (3o/2, ±»r/3). 

15 (3ot 2 /5, 3o//4), 2^ 3 / 4 ; H-na 3 t 5 /5; t = 5/4. 

16 4M/3. 

17 Distant (2a sin a)/(3a) from the centre along the line of symmetry; 
volume (4ira 3 sin 2 a)/3. 

18 7Ma 2 /5. 

19 3Ma 2 /10. 

20 x = j? = 4(a 2 + a6 + 6 2 )/3n(a + 6); x = .p = 2a/w. 

21 (i) 2wa 3 ; (ii) 16wa 3 /15; (iii) 8wa 3 /5. For the area in 1st quadrant. 


EXERCISE 8 (page 182) 

1 i = 4, / = 2; F 42 = a ix b lk c k2 + a i2 b 2k c k2 + a i3 b 3k c k2 and a ix b lk c k2 = 
onb xx c 12 + a xx b x2 c 22 + 041613032 + 041614 C 42 etc. 

2(i) (j jo) <iU> (:J Zl). 

3 (i) (ii) a tj b jk ; (iii) (b lk , b 2k . b nk ); 

(iv) (a f i, a i2 . a in ). The Arth row, ith column of B'A' is b xk a ix + 

b 2k a i2 + . ■ . + b nk a in , i.e. b ik an- 

5 PQ = AD X AAD 2 A = AD X ID 2 A = A(D x D 2 )A 
QP = AD 2 AAD x A = AD 2 ID X A = A(D 2 D X )A 
Diagonal matrices commute. 

6 ( gb-ah ); ( ae — db). 

7 11. 

8 (i) —35 (transposed); (ii) —70 (interchange C 2 and C 3 ; —2C 3 ); 

(iii) -35 (R 2 - R 3 ); (iv) -35 {C x - 4C 3 ); 

(v) 525 (3C 2 - 5C 3 ). 

9 22. 



11 (i) consistent, indeterminate, x 3 = a, 3x 2 = 14 — a, 3x 2 = 2 — a. 

12 (i) consistent, r — 3, x 4 = — a, 12x 3 = 4x 2 = 9a — 6, 4x 2 = 11a + 2; 

(ii) consistent, r = 3, (xi, x 2 , x 3 ) = (— 1,2,7); 

(iii) inconsistent; 

(iv) consistent, r = 3, x 2 = 0, x 4 = a, x x = 6 — 2a, x 3 = 3a — 8. 

13 Multiply each equation by x. System of four equations with three un¬ 
knowns x 3 , x 2 , and x. Condition for consistency is 

a b c 0 
0 o 6 c 
p q r 0 
0 p q r 


EXERCISE 9 (page 238) 

1 Weight, tension and momentum are vectors. 

2 No. No. 
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3 Use property that in any triangle ABC 

AB^AC + CB. 

4 AC =_2a +_2b, DBj= 2b -_2a, CB = -3ab, CA = -AC. 

5 (i) FE =JAC -JAB = J(AC - AB) = \BC. _ 

(ii) Use OD = OB + JBC, OE = OC + \CA, OF= OA + }AB, 
and AB + BC + CA = 0. 

6 R = (AB + AD) + (CB + CD) = 2AQ + 2CQ; 

AQ = AP + PQ; CQ = CP + PQ. 

7 3b - 2a. 

9 G is centroid of ABC. 

10 (i) V101; direction ratios [8: — 1: —6] 

direction cosines 37°15', 95°43', I26°39'. 

(ii) \/158; direction ratios [10:7:3]. 

11 (3, -3, -1) 

12 5; 55°33', 64°54', 45°. 

13 r = (21 - 3j + k) + r(7j + k); (2,27/5,11/5) or (2, -57/5, -1/5). 

14 (x - 1)/—3 = (y - 1)/—2 = (z — l)/2. 

15 (3/7, 37/7,2/7); (11,0,-5). 

16 13(4i + j + 4k)/V33. 

17 -7/V53. 

18 12(4j + 5k)/41. 

19 V(41/2). 

20 VC35/6). 

21 24 units of work. 

a + b = c, a.c = 0. 

Area = |a x b| = 3^10; (5i - 4j - 7k)/3V10; 5x - Ay - Iz = 0. 
(-31 + 5j + llk)IVl55; (161 + 25j - 7k)/ v '6>/155; 

(-491 + 30j - 27k)/ v / 26V155. 

25 —121 + 15 j + 9k 

29 (1,2,3). 

30 3. 

31 9. 

32 -2,3/2. 

33 (2, -1,0). 

34 Planes have a common line of intersection. 

35 Planes form a prism. 

36 (a x b) . a X c = [c X (a x b)]. a and expand the triple vector 

product. _ 

37 Write CB = a, CA = b, AB = c, c = a — b, and form the product 


38 


x y z 1 

*i yi *i i 

-v 2 y-i zi i 

•X3 }'3 Z 3 1 


= 0; 10* + 9y - 3z - 17 


0 . 


39 —sin t i — 2cos2/ j + 2/k; —cos/i + 4 sin 2t j + 2k; 

(cos 2 t + 16 sin 2 2 1 + 4) 1 . 

42 Equation of circle r = 2, r is at right angles to r. 

44 acj 2 radially towards the centre and aw tangentially in direction of 
increasing 0. 
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45 Radial and transverse components of velocity given by (x cos 6 + y sin 0) 
and (y cos 0 — x sin 0) respectively. Similar expressions for acceleration 
components. 

47 i- = a>r cot a ; r6 = mr; constant angle is a. 
f t = co 2 r (cot 2 a — l);/o = 2m 2 r cot a 

48 x = t 2 + l;y = 2/ + 2;z=3 — / + r 3 /6. 

49 (i) A = 3, (4/V17, - l/V^Y 

A = 9, (1/V2, -1/V2) r - 

(ii) A = -1 , (0, 1/a/ 2, -1/V2) r ; A = 0, (1/V2, -1/V2, 0) r ; 

A = 6, (7/V662, 17/V662, 18/V662) T 

50 Characteristic equation A 3 — 27A 2 + 180A — 324 = 0; A = 18, 6, 3; 

A = 18, (2/3, -2/3, 1/3) T ; A = 6, (2/3, 1/3, -2/3) r ; 

A = 3, (1/3, 2/3, 2/3) T . 

51 A = -1, -1, 2; A = 2, (1, 1, l) r ; A = -1, a(l, -1, 0) T + 0(1, 0, -l) r 
Choose two vectors i + j + k and i — j, the third is then I i j k I 

1 11 
|l -1 o| 

i.e. i + j — 2k. Three mutually orthogonal normalised eigenvectors are 
(1/V3, 1/V3, 1/V3) r , (1/V2, - 1/V2, 0) r , and (1/V6, 1/V6, -2/V6) r . 

52 (V(^)i = n x = 2xi + 2y\ + 2zk; (v^) 2 = n 2 = 2xi + 2yj — k. 

n x . n 2 = |n! | |n 2 1 cos 0,i.e. cos 6 = (4x 2 + 4y 2 — 2z)/(2\/14)V(4z + 9) 

when x = 1, y = — 2, z = 3. Hence cos 0 = 1/V6. 

53 4> = PQ 2 = (x - 2 ) 2 + (7 + l ) 2 + (z - l ) 2 
grad * = 2 (x - 2 )i + 2 (y + l)j + 2 (z - l)k. 

grad 4 • a [2(x - 2), 2(y + 1), 2(z - 1)]. [(1, 2, 1)] „ 

|a| VC1+4+1) ' 

54 F = = v (log r ) = (vr)/r = r/r 2 . Work done in small displacement 

is F . dr with dr = —<o sin cosi + co cos wsi + ak. Total work done is 

r l <o 

EXEROSE 10 (page 271) 

1 (i) -d + ic\ (ii) 5 - /; (iii) -i + fi; (iv) -2; (v) (-11 + 20/125; 

(vi) (-3 + 110/5. 

2 (i) (x 4 — 6x 2 y 2 + j 4 ) + /(4x 3 y — 4xy 3 ); 

(ii) [(1 - x 2 - f) + 2 i >]/[(1 - x) 2 + /]; 

(iii) x(2x 2 + 2y 2 + l)/2(x 2 + 7O + »>(2x 2 + 2y 2 - l)/2(x 2 + y 2 ). 

3 R = (PC — LQ)ISQC. 

4 (i) 3e'°; (ii) 3e ,7r ; (iii) V2e l77 ' 4 ; (iv) V2e 3177 ' 4 ; (v) le- <77 ' 6 
(vi) 16e _ 177,3 ; (vii) \/2e‘ 1 ' 7i ; (viii) 16v'2e l7r ' 4 ; (ix) \/6e~ i0 
where 0 = tan _1 (2 — V3) = jt/ 12. 

5 V2(l - 0; —2 15 /. 

6 x = — 1 ± 2i, x = 1 (twice). 

7 e‘ <77,6+,C77 ' 3> , A: = 0,1,2, 3,4, 5. 

8 2-^'2e U277,9 + 2k77,3) , k = 0,1,2. 

9 z, = e ‘or/ 5 + 2 '«r/5>, k = 0,\, 2, 3,4. 

Evaluate (z - z 2 )[(z - z 0 )(z - z 4 )][(z - z a )(z - z 3 )]. 

14 Second equation gives x = k* or y = 0; if y = 0, the first equation 
gives cos x = — 3 which is impossible; if x = kn, the first equation gives 
coshy = 3 when k = 2r + 1 (k and r integers). 
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15 (i) line x = 2; (ii) line y = — x; 

(iii) interior of wedge 0 < 0 < tt/ 3; (iv) interior of circle x 2 + y 2 = 1 ; 

(v) circumference of circle centre (3, 0) radius 2; 

(vi) exterior of circle centre (0, —3) radius 4; 

(vii) circumference of circle x 2 + y 2 = 2; 

(viii) circumference of circle (x — 2) 2 + y 2 = 2; 

(ix) interior of annulus x 2 + y 2 = 1; x 2 + y 2 = 4 between the radii 
6 = tt/4; & — tt/ 2; boundary, except line 0 = w/4, is included. 

16 (i) analytic on line x = 0; 

(ii) analytic everywhere; f\z ) = 2 + 3i; 

(iii) analytic everywhere except at z = 0;/'(z) = —{cos (l/z)}/z 2 ; 

(iv) not analytic. 


17 Write — 
or 


u . 8u 8u . . 8i 

x + smd ^ro = - rsmd a; 


and use Cauchy-Riemann equations. 

IS a = 2 = d; b = —1 = c. (Cauchy-Riemann equations to be true every¬ 
where.) 

19 Real axisy = 0, —oo < x < —1 maps into real axis t> = 0, + oo > « > 0 
and y = 0, — l<x<+co maps into real axis v = 0, 0 < u < +co. 
Imaginary axis x = 0, — oo < y < 4- °o maps into parabola v 2 = 4(1 — «). 
Semicircle maps the region interior to quadrant of circle 


(x+ l) 2 4-y 2 = 1, x> 0,y > 0. 

20 AB: line v = 0 from u = 3 to u = 2. 

BC: circle (m — f) 2 + v 2 = £, v < 0, from u = 2 to u — If. 

AC: circle (« — 2) 2 + (v — l) 2 = 2, v < 0, from u = 3 to u = 1|. 

21 Equation of curve in w-plane is p = c** where w = pe'*. 

22 v = x 3 — 3xy 2 + constant; w = iz 3 + constant. 

23 Slope of curve u(x,y) = constant, is m 1 = (—(0i//0x)/(5«/0y), and slope 
of curve v(x,y) = constant, is m 2 = — (Su/0x)/(5y/0y). Use Cauchy- 
Riemann equations to show that = — 1. 

(i) u — const, are rectangular hyperbolae x 2 — y 2 = A 2 and 
v = const, are also rectangular hyperbolae xy = n. 

(ii) « = const, are systems of coaxial circles (x — A) 2 + y 2 = A 2 and 

v — const, are also systems of coaxial circles x 2 -f (y — n) 3 = a* 2 . 

(iii) u = const, are systems of hyperbolae x 2 /sin 2 u — y 2 /cos 2 u — 1 and 
v = const, are systems of ellipses x 2 /cosh 2 v + y 2 /sinh 2 v = 1. 

24 v = y - x 3 + 3xy 2 + C; w = z - iz 3 + »C. 


EXERCISE 11 (page 305) 

A, B, C and E are arbitrary constants. 

1 (i) y = Ae 3xis ; (ii) y = log (A + e 1 ); 

(iii) 4V(* 3 + 1) + 3 log (y 2 + 1) = C; 

(iv) 4y 3 = [3 V(2x) 4- C] 2 ; (v) 2y + sinh 2y = 4 cos x + C; 

(vi) 21/(4 - y 2 ) = V(2x 2 + 3) + C. 

2 (i) 1 + c~ y = ^e COSI ; (ii) 3y — 3 log |1 + y| = x 3 -f A; 

(iii) e J (l — e") = 1 — e; (iv) 4 — y 2 = A(l — x) 2 . 

3 (i) y = (x + l) 2 — 16(x + l) -2 ; (ii) y = 3 sin x cos x + A cos 2 x; 
(iii) y log x = x log x — x + C; (iv) y = 1 + A cos x; 

(v) y = 2x cos x — sin x + /( cos x. 
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6 

7 


8 

9 


10 


11 

12 


13 

14 

15 

16 

17 

18 

19 

20 

21 


22 


23 


24 


(ii) y = A cosec x + 2 cot x + 2x — x 2 cot x; 

(iii) y = ix 3 + Ax~ 1 . 

(i) xy = Ae xlv ; (ii) (y — x) 3 = Ax(2y — x) 3 ; 

(iii) y = Ax(y + x); (iv) x(3y 2 + x 2 ) = C; 

(v) y = Cx. 

(i) lx 2 + xy + iy 3 = C; (ii) xV + e* + 6* = C; 

(iii) x 2 y + xy 2 - if = C. 

(i) X 2 + y 2 = 1; (ii) y = (arcsin x) 2 + A arcsin x + fi; 

(iii) k 2 y = B — kx + log (A + e 2kx ). 
v 2 = K2kx - \)/(2k 2 ). 

(i) A<r 2t + fie 51 ; (ii) A + fie -31 ; 

(iii) (A + Bx)e~ 2x ; (iv) e 2x (A cos 3x + fi sin 3x); 

(v) e mx (A cos mx + fi sin mx) + e" mx (C cos mx + E sin nix). 

(i) (a) y = A(e~ x - e~ 3x ), (b) y = i(3<r x - c~ 3x ); 

(ii) (a) y = Axe 1 , (b) y = (1 - x)e r . 

(i) (9x 2 - 24x + 26)/27; (ii) i(3x 2 + 8x). 

(i) e~ x (A cos 2x + fi sin 2x) + (lOx + 16)/25; 

(ii) e 2x (Ax + B) + Jx 2 + ix + f; 

(iii) Ae~ x + Be 1(5 - 2x 2 + 16x - 84; 

(iv) A cos (xV3) + fi sin (xV3) + C + Ex - x 2 (4 - x 2 )/36. 

(i) ie 1 ; (ii) f xe' 1 ; (iii) JxV; 

(iv) ie~ x (2x - 3); (v) (25x 2 - 20x - 2)e x /l25; 

(vi) ixe 31 . 

(i) e~ x (A cos 2x + B sin 2x) + ie*(3 + 2x); 

(ii) A + Be' 21 + e 2l (8x 2 - 12x + 7)/64; 

(iii) A + e x (B + Cx - ix 2 + ix 3 ). 

(i) — i cos x + f sin x; (ii) ix sin 3x; 

(iii) — e* sin x; (iv) —ixe 1 cos x; (v) — Jx 2 sin x. 

(i) A cos 3x + fi sin 3x + f cos 2x; 

(ii) Ae~ ix + (4 cos x + sin x)/17; 

(iii) Ae~ 2x + Be~ 3x + (cos x + sin x)/10 - (5 cos 2x - sin 2x)/52. 

(i) x = 2te 2 ‘; (ii) x = 2 sin / — sin 2 1. 

(i) y = ixe- 3l (x - 2); (ii) y = Ae~ x (1 + x). 

(i) e*(l - ix + ix 3 ); (ii) e* - e“ 3 *. 

(i) 4e _I (x — 1) + 4e' 21 ; 

(ii) [e“ z (18 cos 3x + 40 sin 3x) + 3 sin 3x — 18 cos 3x]/l 11. 

(i) (Ax + B)e x + x 3 + 6x 2 + 18x + 24; 

(ii) e 5z (A cos 2x + B sin 2x) + e x (l 1 cos 3x — 24 sin 3x)/697; 

(iii) (A + Bx)c ~ ll ; (iv) e~ x (Ax + fi) + x — 1; 

(v) Ae~ x + fie -21 + ie 1 ; 

(vi) (Ax + fi) cos 2x + (Cx + E) sin 2x. 

(i) Ae x + fie 41 — (4 cos 2x — 3 sin 2x)/25; 

(ii) Ae~ x + Be 31 ; (iii) A + Be * x - (2 cos 2x + sin 2x)/10; 

(iv) Ae~ 2xl5 + e I (49x 3 - 105x 2 + 150x + fi)/l 029; 

(v) e 3x (A cos x + fi sin x) + e I (5x sin x + lOx cos x + 2 sin x + 

14 cos x)/25 + (25x 2 + 40x + 22)/125. 

Isoclines y = (1 — m)x. Solution curves are hyperbolae with asymptotes 
x = 0 and y = ix. 

Solution obtained from v'x = 1 where v = y/x. Isoclines are straight lines 
y — (m — l)x. Solution curve through (1,0) is y = x log x. As x -*• 0, 
y 0, y' -»■ oo; Turning point at (1/e, — 1/e); x -» co, y -*■ oo. 
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25 Isoclines are (x — m/2) 2 + y 2 = (m/2) 2 where m is slope on solution 
curves. These are circles through the origin (0,0) centres (m/2,0), radius 
m/2. 

EXERCISE 12 (page 320) 

1 (i) 3x 5 + 5x i + 10x 3 + 24x 2 + 48x + 96, remainder 191; 

(ii) a 5 + 4x 2 /3 + Ax 19 + 4/27, remainder -23/27. 

2 Function has zero value at x = 1, quotient is 4 a 3 — 2x 2 + 2x — 1 
which vanishes at x = i, final quotient is 4x 2 + 2. Factorization is 
(x - l)(2x - l)(2x 2 + 1). 

3 3jc 3 + 1/(jc — 3) — 4 l(x - 3) 2 + 5/(x - 3) 4 . 

4 (i) Ax* + 8x 3 — 3x 2 — 6x — 21, remainder —24x + 76; 

(ii) 2x 4 — 3x 3 + 2x 2 + 7-5x — 19-25, remainder 65-25x — 6-25. 

5 Roots of given equation are a, ft, and y and a + ft + y = o, a/S + ay + 
py = 2/3, afty = 1/3. Roots of new equation are a 2 , ft 2 , and y 2 and 

a 2 + ft 2 + y 2 — (a + ft + y) a — 2(a/S + a y + fty) = —4/3, 
a 2 ^ 2 + a 2 -/ 2 + ft 2 y 2 = (aft + a y + (ty) 2 — 2afty(a + 0 + y) = 4/9 
* 2 /?V = (a Py) 2 = 1/9. 

New equation is 9x 3 + I2x 2 + Ax — 1 = 0. 

6 Roots a, p, and y then « + p + y = 5/3, and a + p = 2. Hence y = 
-1/3. Roots are -1/3, 3/2, and 1/2. 

7 x = 3-5, (7 ± *yil)/6. 

8 f(x) has one variation, i.e. not more than one positive real root; f(—x) 
has one variation, i.e. not more than one negative real root. But/(—oo) > 
0, /(0) < 0, and /(+ oo) > 0 hence there is not less than one positive 
real root and one negative real root. There is exactly one positive and 
one negative real root, and hence two complex roots. 

9 x = 2-10, 0T5, and -3-25. (Check: sum of roots is —1). 

10 r 3 cos 3 0 - 3r cos 6 — 1 with r 3 :3r = 4:3, i.e. r = 2 and 4 cos 3 0 - 
3 cos 0 = cos 30 = 1/2. 

Roots of reduced equation 

30 = 2nir ± ir/3, « = 0,1,2,. . . 

Roots required 

xj. = 2 cos *19 = 1-879 4 

* 2 = 2 cos (2*/3 + *19) = -1-532 1 

x 3 = 2 cos ( 2*/3 - */9) = -0-347 3. 

Other values of n repeat the above roots. 

11 cos 30 = 2 with 6 = a + iP leads to 

a = 2n*l3, P = (cosh- 1 2)/3 = 0-439, n = 0,1,2,. . . 

Roots are x n = 2 cos 0 = 2(cos a cosh P — i sin a sinh P), n = 0, 1, 2. 

Xj = 2-196; x 2 = -1-098 + /0-785; 

*3 = -1-098 - /0-785. 

EXERCISE 13 (page 352) 

1 0-34. 

2 0-38, -0-65. 

3 0-3376. 

4 -0-374. 
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5 

6 
7 


9 


10 


11 

12 

13 


15 


16 


2-646. 


Use Newton-Raphson method with f(x) = x 3 — a, 2-154. 

0-125 7. 

0-347 3; -1-879 4. 

First estimates: x — 4-237,/ = —0-853, z = —2-508; 

First residuals: —0-000 2, +0-000 8, —0-000 1. 

Second estimates: x = 4-236 9, / = -0-852 7, z = —2-507 9; 

Second residuals: -0-00001, -0-000 04, +0-000 04. 

(i) x = 4-236 9, / = —0-851 5, z = —2-508 2 in six iterations starting 
from (0,0,0); 

(ii) x = 4-236 8, / = —0-852 6, z = -2-507 9, correct to four decimal 
places starting from (0,0,0). 

0-339, —0-041, +0-638, —0-540 correct to three decimal places in four 
iterations starting from (0,0,0). 
x 0 0-2 0-4 0-6 0-8 1 0 

/ 1 0-8 0-67 0-58 0-51 0-46 

Solution by separation of variables is/ -1 = x + 1 and j(l-0) = 0-5. 
x 0 0-1 0-2 0-3 0-4 

/ 1 1-116 1-275 1-498 1-824 

/(0-4) 1-82. 


x 0 0-2 0-4 0-6 0-8 1-0 

/ 1 0-856 0-787 0-770 0-791 0-840 

Minimum occurs at x = 0-58, y — 0-76, approximately. 

Maximum error 0-04 (approximately). Answer is 1-8 correct to one decimal 
place. 

Absolute error in x is ±0-005, absolute error in / is (approx.) ±0-0028 
and / = 0-2 correct to one decimal place. 


EXERCISE 14 (page 392) 

1 Third differences constant at 0-750. Overlapping errors. Corrected values 
y e = 438-976, /„ = 636-056. 

2 /(0) = 1-40, *o = 0, h = 1, /(0 + ul) = 1-40 + 0-64 m + 0-16«(« - 1) 
hence 100/(x) = 104 + 4(2x + 3) 2 . 

3 Use /(xo + uh) with x 0 = 0, h = 2 then replace 2 u by x, 

8 f{x) = 568 + 200x + 24x 2 + x 3 . 

4 /(16-4) = 267-6 and /(23-5) = 408-9. 

5 sinh 4-612 = 50-337(6) in each case. Same result because A 2 / is constant 
over this range and sinh x is represented by a quadratic polynomial. 

6 /(0 3) = 0-295 7 (Newton-Gregory forward); 

/(2-55) = 1-415 0 (Newton-Gregory backward); 

/(1-75) = 1-226 8 (Bessel interpolating to halves); 

/(1-55) = 1-152 6 (Stirling using up to fourth differences). 

7 x 0 = 40, h = 10, leads to iteration equation 

u = 0-588 235 - 0-103 387 u 2 + 0 005 348« 3 
and u = 0-556 2, x = 45-562° or 45° 34'. 

8 /'(14) = 0-019 92,/"(l2) = -0 000 04. 

9 Maximum occurs when x = 3-336 and/(x) = 1 -226 4. 

10 (i) 0-341 277, error estimate +0-000 27; 

(ii) 0-341 508, error estimate —0-000 007; 

(iii) 0-341 505, error estimate +0 000 001. 
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EXERCISE 15 (page 430) 

1 (0 I; GO b 

2 (0 t, (ii) i; (iii) ts; (iv) b 

3 (4 !)(48 !)/(52 !)(13 4 ) = 1/270 725. 

4 1 023/1 024. 

5 (i) 5 2 /6 3 ; (ii) 5 n-1 /6 n . 

6 (i) 1/18; (ii) 1/9; (iii) 1/12. 

7 (i) 9/20; (ii) 1/20; (iii) f. 

8 9/17, 8/17. 

9 b 

10 (i) 1/10; (ii) |; (iii) 3/10; (iv) i; (v) b 

11 (i) 1; (ii) 1 - e~*(l + x), x > 0; (iii) 0-264; (iv) 0-33; (v) 2, 2. 

12 (i) c = 1; (ii) x < 0, F(x) = 0; 0 < x < 1, F(x) = ix 2 ; 

1 <x < 2, F\x) = 2x - ix 2 - 1; x > 2, F(x) = 1. 

13 2-527, 0-100 4. 

14 (i) mean decreased by 10, standard deviation unchanged; 

(ii) new mean 90% of old mean, new s.d. 90% of old s.d. 

15 (i)/= 0-031 25, 0-156 25,0-312 5; 

(ii) /= 0-001, 0-010, 0 044, 0-117, 0-205, 0-246; 

(iii) zero until r = 5 when/ = 0-002, 0-005, 0-014, 0-032, 0-061, 0-098, 
0-133, 0-155. All three histograms symmetrical. 

16 /= 243, 405, 270, 90, 15, 1. 

17 (i) (a) 60-5, (b) 60-7; (ii) (a) 9-0, (b) 9-0. 

18 16-53, 29-75, 26-78, 16-07, 7-23, 2-60. 

19 (i) 0-049 8; (ii) 0-005. 

20 42-8. 

21 (i) 50; (ii) 68-26; (iii) 95. 

22 1 002-635, 3-55 P{x < 990} = 0 000 19. 

P{x > 1 010} = 0-019. 


EXERCISE 16 (page 460) 

1 (i) 0-37; (ii) 0-091 2. 

2 X = 1 -997, machine should be adjusted. 

3 \x\ = 7-298, mean has changed. 

4 X = 2-67, significant difference, firm’s claim discredited. 

5 2 655. 

6 1= 1 -677, there is no reason to doubt that the entry requirements are 
the same; X = 0-833, no conclusion. 

7 Not necessarily, X = 2-236 (i) significant, (ii) not significant. 

8 t = 2-897, significant difference. 

9 X = 1-284, no evidence that men are better drivers. 

10 t — 2-74, significant, sleeping drug is effective. 

11 t = 1-51, not significant, coaching not effective from data. 

12 t = 2-23, not significant. 

13 t = 0-73, there is no reason to doubt that the rods come from the same 
population. 

14 (i) 0-444, 0-391, 0-138, 0-024, 0-002, 0-000, . . .; 

(ii) 0-472, 0-354, 0-133, 0-033, 0-006, 0-001; 

(iii) B.D. x 2 = 38-74, P.D. x 2 = 30-48, increase is significant in both 
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15 x 2 — 3-56 with 4 degrees of freedom, fit satisfactory. 

16 x 2 = 46-79 with 6 degrees of freedom, significant, larger canteens produce 
cheaper meals. 

17 x 2 = 1 '87 with 4 degrees of freedom, fit satisfactory. 

18 x 2 very large with 12 degrees of freedom, manual dexterity and success 
in examinations are not independent. 


Index 


Abscissa, 13, 322 

Absolute, convergence, 51, 254, 255 
error, 344 
error modulus, 345 
frequency, 406 
magnitude, 4 
maximum, 90 
value, 4 

A.C. bridge, 509 
Acceleration, 225 
radial, 226 
transverse, 226 
Accuracy, 346 

Addition, formulae, 71, 263, 397 
of complex numbers, 247 
of directed magnitudes, 186 
of matrices, 150 
of vectors, 188 
theorem of statistics, 397 
Adjugate or adjoint matrix, 168 
Admittance, 506 

Algebra, of complex numbers, 247 
of matrices, 150 
of vectors, 188 

Algebraic equations, 146, 174, 178, 
261, 310 

Alternating current, 500 
Alternating series, 58 
Amplitude, 251, 292 
Analogue computer, 491 
Analytic functions, 226 
Angular momentum, 217 
Angular speed, 226 
Approximate, differentiation, 380 
integration, 382 

solution of differential equation, 
338 

solution of equation, 323 
Approximation, 33, 81 
to roots, (Ch. 13), 321, 378 
Arbitrary constant, 102 
Area, as integral, 128 
as limit, 126 
by integration, 108, 109 
definition, 127 
in polar coordinates, 130 
under curve, 109, 129 


Argand diagram, 246, 250, 253, 259, 
B 260, 266 

Argument, 1, 25, 354 
Array, 147 

Associative law, 151, 155, 185, 192, 
207 

Asymptote, 472 
Attribute, 395, 443 
Augmented matrix, 160, 172, 335 
Axis, of imaginary numbers, 246 
of real numbers, 1, 246 
Backward interpolation, 364 
Balance, equation, 511 
bridge, 510 
Base vector, 496 
Bending of beam, 299 
Bessel, 366, 372 

Bessel’s interpolation formula, 372, 
373, 374, 377, 381, 382 
Binary quadratic, 92 
Binomial, coefficients, 31, 361, 367 
distribution, 417, 422, 427, 444 
frequency function, 417 
Boundary conditions, 478 
Bounded sequence, 49 
Cartesian, axes, 194, 207 
coordinates, 13, 131,193,196, 202, 
250 

Catenary, 472 

Cauchy mean value theorem, 81 
Cauchy-Riemann equations, 267 
Capacitative network, 504 
Central limit theorem, 439 
Centre of mass, 134, 135, 136, 137 
Centripetal acceleration, 227 
Centroid, 134, 137, 198 
Chain rule of differentiation, 95 
Chi-square distribution, 451 
Circular functions, 299, 472 
Class interval, 406 
Coefficient matrix, 333, 487 
Co-factor, 162, 163 
Combinations, 399 
Commutative law, 151, 158, 185,187, 
207, 209, 210, 248 
Comparison test, 55 
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Complementary, event, 400 
function, 287 
Complete primitive, 287 
Complex, current, 501 
functions, 263 

numbers, (Ch. 10) 244, 256, 500 
roots, 258, 261, 319 
variable, (Ch. 10) 263 
voltage, 502 

Component vectors, 192, 193 
Concavity, 34 

Conditional, convergence, 51 
probability, 401 
Conduction, 477 
Confidence interval, 438, 448 
Conic section, 471 
Conjugate complex numbers, 248 
Consistent system, 176 
Contingency table, 454, 455 
Continuity equation, 526 
Continuous, function, 4, 22 
variable 2, 405, 406 
Convection, 477 

Convergence, (Ch. 3), 47, 51,60, 321, 
327, 329, 335 
absolute, 51 
conditional, 51 
radius of, 60 
tests, 55 

Convergent integral, 121 
Cooling fin, 475 
Coordinates, 13, 14, 15, 16 
Cosine series, 88, 254 
Cramer’s rule, 172, 174, 175 
Critical damping, 290 
Cross product, 209 
Crystal, 496 
axes, 496 

Crystallography, 496 
Current, complex, 501 
cyclic, 479, 492, 509 
matrix, 480 

network, 479, 490, 492, 500 
response, 501 
Curve sketching, 34 
Cyclic current, 479 
Cycloid, 132 

Cylindrical polar coordinates, 14 

D-operator, 22, 285 
Damped oscillations, 290 


Damping factor, 290 

Definite integrals, 107 

Degree of differential equation, 274 

Del, V, 235 

De Moivre’s theorem, 256 
Dependent variable, 1 
Derivative, 20, 22, 41 
directional, 233, 236 
material, 524 
of complex function, 266 
of vector, 221, 524 
Descartes rule of signs, 319 
Detached coefficients, 311, 312 
Determinants, (Ch. 8), 148, 160-68, 
487 

Deviation, mean, 411 
standard, 411 
Diagonal, matrix, 158 
terms, 335 

Difference table, 354, 357 
Differences, 355 
Differentiable function, 20, 22 
Differential, 34 
coefficient, 20 

Differential equations, (Ch. 11), 63, 
102, 274, 511 

approximate solution of, 338 
Euler method, 337, 340 
numerical solution of, 336 
Differentiation, (Ch. 2), 20 
chain rule of, 95 
following the motion, 525 
logarithmic, 69 
notation, 21 
numerical, 380 
of complex function, 266 
of exponential function, 67 
of implicit function, 28 
of inverse function, 38, 39, 75 
of logarithmic function, 67 
of power series, 60 
of vector, 221 
parametric, 27 
partial, 41, 93 
rules, 23 
total, 94 

Diminishing of roots, 318, 325 
Directed magnitude, 185 
Direction, angles, 195 
cosines, 195, 482 
ratios, 196, 497 
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Directional derivative, 233, 236 
Discrimination of maxima, 36, 37,92 
Dispersion, 411, 412 
Displacement, 486, 520 
gradient, 520 
in elasticity, 519 
rate of, 520 
vector, 188, 496 

Distribution, binomial, 417,422,427, 
444 

Chi-square, 451 
function, 405 
Gaussian, 424 
normal, 424, 427, 434, 436 
of mean, 434 

Poisson, 415, 420, 422, 453 
standardized normal, 425 
student-?, 447 

Distributive law, 155, 185, 192, 207, 
210 

Divided difference, 357 
Divergence, 51, 321, 327, 330, 335 
Divergent, integral, 121 
series, 51 

Division of complex numbers, 249, 
251 

D-notation, 285 
Dot-product, 206 

e, 51, 64 

Eigenvalue, 227, 228, 479, 484, 487 
Eigenvector, 227, 228, 479, 484,487 
normalised, 229 
orthogonal, 231 
Elasticity, 519 
Electrical network, 146, 299 
Element, 147 
Elementary event, 396 
Ellipse, 471 
Ellipsoid, 133 
Entry, 354 

Equality, of complex numbers, 247 
of matrices, 150 
of vectors, 188 

Error, 81, 322, 333, 344, 357, 381, 
387, 391 
absolute, 345 
in iterative process, 350 
interval, 346 
modulus, 345 
percentage, 346 


Error— contd. 
propagation of, 346, 358 
relative, 346 
round-off, 344, 378, 391 
Estimator, 434 

Euler form of complex number, 256, 
260 

Euler method, 337 
improved, 340 
Even functions, 9 
Event, 395 

Exact differential equation, 279 
Exclusive events, 395 
Exhaustive events, 396 
Expansion in power series, 86 
Experiment, 395 

Exponential function, (Ch. 4) 63, 65, 
68, 254, 295, 299, 354,361,363, 
468 

limits, 66 
series, 64, 88, 254 
Extension of spring, 299, 485 
Extreme value, 91 

Factorial function, 363 
Factors, of linear operator, 286 
of polynomial, 313 
Factor theorem, 311 
False position rule, 323 
Filter, 490 
band-pass, 495 
high-pass, 495 
low-pass, 494 
First, difference, 355 
moment, 135, 144, 412 
order differential equations, 275, 
280, 337, 468, 513, 514 
Fluid dynamics, 524 
Forward interpolation, 359, 360 
Frame network, 480 
Freedom equations, 202 
Frequency function, 403 
Function, of complex variable, 263 
of one variable, 1 
of three or more variables, 43 
of two variables, 16, 89 

Gamma function, 363 
Gauss, 172, 333, 366 
interpolation formulae, 368, 371, 
372, 387 
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Gaussian, distribution, 424 
elimination method, 172, 230, 333, 
352, 480 

Gauss-Jacobi method, 335, 336, 352 
Gauss-Seidel method, 336, 352 
Generalized factorial function, 363 
Geometric series, 2, 52 
Gradient, 45 
of scalar field, 232, 526 
Gregory, integration formula, 383 
interpolation formula, 360, 364 

Harmonic, conjugate functions, 
269 

functions, 269, 273 
series, 53 
Helix, 242 

Higher derivatives, 22, 42 
Histogram, 406 

Homogeneous, differential equation, 
277 

system 175, 177 
Hooke’s law, 485 
Hyperbola, 471 
rectangular, 472 

Hyperbolic functions, (Ch. 4), 70, 
262, 468, 471 
cosine, 471 
sine, 471 

i AS OPERATOR, 244 
Identity matrix, 57 
Ill-conditioned equation, 332, 334 
Imaginary, axis, 244, 503 
part, 247, 297, 510 
Impedance, 502 
Implicit function, 28 
Improper integral, 122 
Improved Euler method, 340 
Inconsistent equations, 175, 213 
Increment, 21, 31, 93 
Indefinite, integral, 102 
quadratic, 92 
Independent variables, 1 
function of, 16, 41, 43 
Indeterminate, expression, 82, 175 
solution, 175, 213 
Individual, 395 
Inductive network, 504 
Inequalities, 17, 254 


Infinite, integrals, 121-123 
limits of integration, 121 
sequence, 47 
series, 51, 52 
Inflexion, 37 
Inflexional tangent, 37 
Initial conditions, 488, 515 
Integral, standard forms, 102 
Integrand, 106, 110 
Integrating factor, 280, 282 
Integration, (Ch. 6) 
applications of, 127 
as sum limit, 126 
Integration, by parts, 115 
by quadrature, 382 
by reduction formula, 117 
by substitution, 119, 120 
graphical, 299 
numerical, 382 

of function with irrational denomi¬ 
nator, 114 
of power series, 61 
of rational function, 110-114 
rules, 104 

Interpolation, 323, 354, 359, 364, 373 
formulae, 371 
paths, 365, 366, 372 
Interquartile range, 411 
Intersection of planes, 213 
Interval, 1, 438 
Inverse, derivative, 38 
function, 9 

hyperbolic functions, 74, 75 
interpolation, 378 
matrix, 158, 170, 173, 480 
square law, 236 

trigonometric functions, 11, 12, 39 
Irrational number, 2 
Isocline, 299, 305 

Iteration, 321, 326, 329, 334, 335, 378 
error, 350 

j operator, 244, 501 
Jacobi, 335, 336 

Kronecker delta, 167 

Lag, 504 

Lagrange form of remainder, 86 
Laplace’s equation, 269 
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Latent roots, 228 
Lattice, crystal, 496 
reciprocal, 498 
Lead, 504 

Leibnitz' theorem, 31 
Level, curve, 232 
surface, 235 
rHopital's rule, 82 
Limit, 4, 48, 66, 82, 126 
Limits of integration, 108, 121 
Linear, algebraic equations, 146 
dependence, 176 
differential equation, 285 
independence, 175, 178 
interpolation, 323, 354 
transformation, 147 
operator, 285 
programming, 464 
Linked, circuit, 492 
network, 485 
Local maximum, 91 
Location of roots, 318, 321 
Logarithmic, differentiation, 69 
function, (Ch. 4) 66, 69 
limits, 66 
series, 87 

Lozenge diagram, 366, 371, 377 

Maclaurin expansion, 86, 470 
Management problem, 464 
Mapping function, 264 
Mass, balance equation, 513, 514 
conservation, 526 
Matrices, (Ch. 8) 146, 227, 479 
Matrix, algebra, 150, 333 
cost, 467 

form of vector product, 211 
strain, 524 
symmetric, 230 
Maximum, 36, 90, 381 
Mean, 410, 414, 418, 421, 425, 434, 
436, 440, 447, 449 
centre, 135 
deviation, 411 

value theorem, (Ch. 5), 79, 80, 81 
Median, 410 

Miller indices, 497, 498, 500 
Minimum, 36, 90, 381 
Minor, 162 

Mixed derivative, 43, 44 
Mode, 410 
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Model, mathematical, 407, 433, 463, 
467 

Modulus, 4, 251 
of elasticity, 485 
Moment, first, 135, 144, 412 
of force, 206, 216 
of inertia, 140 
of momentum, 217 
of vectors, 216 
second, 140, 144, 413 
Monotonic, function, 9, 34 
sequence, 49, 59 
Multiple roots, 311 
Multiplication, of complex numbers, 
248, 251 

of matrices, 149, 152, 480, 484 
theorem of statistics, 402 
Mutually, exhaustive, 396, 397 
exclusive, 395, 397 

Nabla, V, 235 
Natural, frequency, 488, 493 
growth law, 468 
Negative definitive, 92 
Neighbourhood, 36, 91 
Network, electrical, 479, 490, 500 
frame, 480 
vibration, 484 

Newton-Gregory, 359, 360, 362, 364, 
366, 371, 376, 377, 380, 383 
Newton-Raphson, 326, 328,331,332, 
351, 352 

Newton’s, law of cooling, 468, 477 
method, 323, 351 

second law of motion, 276, 485, 
526 

Non-commutative, 154 
Non-singular matrix, 173 
Normal, distribution, 424, 427, 434, 
436 

mode, 486 
to curve, 234 
to plane, 212, 498 
to surface, 235, 242 
Null hypothesis, 436 
Numbers, complex, (Ch. 10), 244 
imaginary, 244, 246 
irrational, 2 
rational, 2 
real, 2, 244, 246 
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Numerical, differentiation, 380 
integration, 382 

solution of differential equation, 
336 

Odd function, 9 
Oil purifier, 512 
Operator, 22 
D, 285, 518 

i, 244 

j, 501 
V, 235 

Order, of differential equation, 274 
of matrix, 147 
Ordered array, 147 
Ordinary differential equations, (Ch. 

11)274 
Ordinate, 13 
Orthogonal, curves, 273 
eigenvector, 231 
matrices, 201 
vectors, 193, 247 

Oscillating, network, 299, 484, 500 
sequence, 49, 327 
Oscillations, damped, 290, 491 
Overdamping, 290 
Pappus’ theorem, 139 
Parabola, 386, 471 
Parallel axis theorem, 143 
Parallelogram law of addition, 187 
Parameter, 27, 435 
Parametric equations, 202 
Partial, derivative, 41 
differential equation, 269 
differentiation, (Ch. 5), 79, 93 
fractions, 110, 314 
sum, 51, 54 

Particular integral, 287, 292 
Permutations, 398 
Perpendicular axis theorem, 144 
Plane, 212 

Poisson distribution, 420, 422, 453 
Polar, acceleration, 226 
coordinates, 14, 130 
form of complex number, 250 
Pole, 14 

Polynomial, 289, 293, 310, 313, 315, 
316, 354 

approximation, 359, 374, 380, 383 
Population, 395 
growth, 468 


Position, ratio, 204 
vector, 193, 496 
Positive definite, 92 
Power series, 59, 86 
Predictor-corrector method, 341 
Primitive, 102, 287 
Probability, 390, 396, 401, 436 
density function, 407 
Propagation of errors, 346, 358 
Pulsatance, 485 
Purifier, oil, 512 

Quadratic form, 92 
Quadrature, 382, 383 
Quotient, 310 

Radius, of convergence, 60 
of gyration, 140 
Random, sample, 433 
variable, 395, 403, 405 
Range, 1, 411 
Rate of change, 34 
Rate coefficient, 514 
Rational function, 112 
number, 2, 29 
Ratio test, 56 
Reaction constant, 514 
chemical, 514 
first order, 516 
second order, 516 
Real, axis, 246, 503 
number, 2 

part of complex number, 247, 297, 
501, 510 

Reciprocal lattice, 498 
Reciprocal matrix, 158, 170 
Reduction formula, 117 
Redundant equation, 231 
Regular, solution, 174 
system, 171 

Relation between roots of equation, 
317 

Relative, error, 346 
error modulus, 348 
frequency, 406 
maximum, 91 
velocity, 190 
Remainder, 85-92 
theorem, 310 
of series, 54, 86 
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Repeated, differentiation, 30 
division, 314 
Residual, 332, 333, 334 
Resisted motion, 290 
Resonance, 506 
Resultant, 188 
Revolution, solid of, 133 
Rolle’s theorem, 79, 81, 319 
Roots, of algebraic equation, 261, 
311, 316, 318 

of complex number, 258, 260 
Rotating axes, 148 
Round-off errors, 333 
Rule, of false position, 323 
of Sarrus, 164 
of signs, Descartes, 319 
Rules, for differentiation, 23 
for inequalities, 17 
for integration, 104 
for interpolation, 367, 369, 375 

Saddle point, 92 
Sample, 395, 433 
attribute, 443 
mean, 434 
tests, 436, 439, 446 
variance, 436, 451 
Sarrus’ rule, 164 
Scalar, 151, 185 
components, 193 
field, 232 
matrix, 158 
multiplication, 15 
product, 206 
three-dimensional, 235 
two-dimensional, 232 
Second, derivative, 22, 23, 97 
difference, 355 
moment, 140, 144, 413 
order differential equation, 283 
Seidel, 336 

Semi-definite quadratic, 92 
Separation of variables, 276, 517 
Sequence, 47 
Series, (Ch. 3), 51, 59 
Shear strain, 523 

Shortest distance between lines, 215 
Significance tests, 436, 440, 444, 447 
Significant figures, 344 
Simple, harmonic motion, 285 
sample, 433 


Simpson’s rule, 385, 387, 389. 390, 
426 

Sine, series, 88, 254 
hyperbolic, 471 
Singular, matrix 170, 175 
solution, 175 
Skew lines, 215 
Slope of curve, 21, 302 
Spherical polar coordinates, 15 
Spring system, 485 
Standard, deviation, 411 
error, 435 
integrals, 102 
series, 87 

Stationary point, 38 
Statistic, 434 

Statistics, (Ch. 15), 394, (Ch. 16), 433 

Steady field, 232 

Step function, 8 

Stiffness, 485 

Stirling, 366 

interpolation formula, 371, 372, 
373, 376, 378, 379 
Straight line, 201, 212 
Strain, 520 
tensor, 524 
Student f-test, 447 
Substitution rule for integrals, 119 
Suffix, notation, 147 
summation, 154, 167 
Sum-limit, 51, 128 
Sum to infinity, 51 
Symmetric matrix, 230 
Synthetic division, 311, 312, 315 
System of equations, 172 

/-DISTRIBUTION, 447 
Tangent, to curve, 21, 35 
inflexional, 37 
method, 338 

Taylor series, (Ch. 5), 79, 85, 89, 310, 
311, 314, 328 
Tension coefficient, 482 
Term by term, differentiation, 60, 61 
integration, 61 
Test, Chi-square, 452 
of convergence, 55 
of distribution, 452 
of independence, 454 
of significance, 433, 436, 440 
Student-/, 447 
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Third difference, 355 
Torsional problem, 299 
Total differentiation, 94 
Transformation, function, 264 
matrix, 148, 199, 227 
of derivative, 97 
Transposed matrix, 158 
Trapezoidal rule, 384, 387, 389, 390 
Trial, 395 

Triangle, inequalities, 190 
of addition, 186 
Triangular matrix, 173, 333 
Trigonometric functions, 262, 354 
Triple, scalar product, 217 
vector product, 219 
Trusses, 481 
Turning point, 36, 381 

Unit, cell, 497 
matrix, 156 
vector, 188 
Unsteady field, 232 
Upper triangular matrix, 173 

Variables, complex, 263 
dependent, 1 
function of three, 43, 94 
function of two, 16, 89 
independent, 1 
separable, 276, 517 


Variance, 411,412, 414, 418, 421,425 
Variation, 319 
Vector, (Ch. 9), 188, 496 
components, 192 
differentiation, 221, 224 
differential equation, 222 
equation of straight line, 201 
in plane, 224 
of amplitude, 487 
operator, V, 235 
product, 205, 209, 211, 217 
quantity, 188 
resolution, 192 
sum, difference, 188, 189 
Velocity, 225, 226 
Vibrational problem, 299, 484 
Voltage, lag, 504 
lead, 504 
response, 501 

Volume, by integration, 132 
element, 133 

Weddle’s rule, 386, 387, 389, 390 
Wien Bridge, 510 
Work, 206, 208 

Yates’ correction, 456 

Zero, matrix, 152 
vector, 188 
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